WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 
C12Q 1/68, C07K 14/47 



A2 



(11) International Publication Number: WO 99/02735 

(43) International Publication Date: 21 January 1999 (21.01.99) 



(21) International Application Number: PCT/US98/ 14354 

(22) International Filing Date: 10 July 1998 (10.07.98) 



(30) Priority Data: 

08/890,979 
09/031,626 



10 July 1997 (10.07.97) US 
27 February 1998 (27.02.98) US 



(71) Applicants: MILLENNIUM PHARMACEUTICALS, INC. 

[US/US]; 238 Main Street, Cambridge, MA 02142 (US). 
TRUSTEES OF TUFTS UNIVERSITY [US/US]; Medford, 
MA 02155 (US). 

(72) Inventors: ACTON, Susan, L.; 42 Munroe Road, Lexington, 

MA 02173 (US). ORDOVAS, Jose, M.; 6 Clearview Drive, 
Framingham, MA 01701 (US). 

(74) Agents: ARNOLD, Beth, E. et al.; Foley, Hoag & Eliot LLP, 
One Post Office Square, Boston, MA 02109 (US). 



(81) Designated States: AU, CA, JP, European patent (AT, BE, 
CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, 
NL, PT, SE). 



Published 

Without international search report and to be republished 
upon receipt of that report. 



(54) Title: DIAGNOSTIC ASSAYS AND KITS FOR BODY MASS AND CARDIOVASCULAR DISORDERS 



(57) Abstract 



The present invention is based at least in part on the discovery of the genomic structure of the human SR-BI gene and on the 
identification of polymorphic regions within the gene. Accordingly, the invention provides nucleic acids having a nucleotide sequence of an 
allelic variant of an SR-BI gene and nucleic acids having an SR-BI intronic sequence. The invention also provides methods for identifying 
specific alleles of polymorphic regions of an SR-BI gene, methods for determining whether a subject has or is at risk of developing a 
disease which is associated with a specific allele of a polymorphic region of an SR-BI gene, and kits for performing such methods. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United Slates of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


C6te d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






cu 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 99/02735 



PCT/US98/14354 



- 1 - 



Diagnostic Assays and Kits for Body Mass and 
Cardiovascular Disorders 

5 

1. Background of the Invention 

Coronary heart disease is a major health risk throughout the industrialized world. 
Atherosclerosis, the most prevalent of cardiovascular diseases, is the principal cause of 
heart attack, stroke, and gangrene of the extremities, and thereby the principle cause of 

1 0 death in the United States. Although historically much emphasis has been placed on total 
plasma cholesterol levels as a risk factor for coronary heart disease, it has been clearly 
established that low levels of high density lipoprotein cholesterol (HDL-C) is an 
independent risk factor for this disease. Family and twin studies have shown that there are 
genetic components that affect HDL levels. However, mutations in the main protein 

15 components of HDL (ApoAl and ApoAII) and in the enzymes that are known to be 
involved in HDL metabolism (e.g., CETP, HL, LPL and LCAT) do not explain all of the 
genetic factors affecting HDL levels in the general population (J. L. Breslow, in The 
Metabolic and Molecular Bases of Inherited Disease, C.R. Scriver, A.L. Beaudet, W. Sly, 
D. Valle, Eds. (McGraw-Hill, New York, 1995), pp 2031-2052; and S.M. Grundy, (1995) 

20 1 Am. Med. Assoc. 256: 2849). This finding in combination with the fact that the 
mechanisms of HDL metabolism are poorly understood, suggests that there are other as yet 
unknown factors that contribute to the genetic variability of HDL levels. 

Another disorder that is often associated with high triglyceride and low high density 
lipoprotein (HDL) concentrations is obesity, which renders a subject susceptible to 

25 cardiovascular diseases, such as ischemia, restenosis, congestive heart failure, and 
atherosclerosis. Severely obese individuals (weighing 60% over a normal weight) have a 
high risk of developing cardiorespiratory problems. They are also at risk of developing 
chronic hypoventilation, which can lead to hypercapnia, pulmonary hypertension, and heart 
failure. Severe episodic hypoxia, which can cause arrhythmias and sudden death, is 10 

30 times more common in the severely obese. Severely obese individuals are also at increased 
risk of suffering from obstructive sleep apnea, pickwickian syndrome (i.e., daytime 
hypoventilation, somnolence, polycythemia, cor pulmonale), and renal vein thrombosis. 
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("Cecil Essentials of Medicine", Andreoli et al., Third Edition, 1993, W.B. Saunders 
Company). 

Moderate obesity (corresponding to a weight between 20-60% above normal 
weight) poses increased risk of early mortality. Obese individuals suffer more frequently 
than non obese individuals from hypertension. Type II diabetes mellitus can also be 
aggravated by excess weight. Obesity can also increase the risk of a subject developing 
cholelithiasis and endometrial carcinoma. 

One candidate factor that is likely to be involved both in obesity and cardiovascular 
disease is the SR-BI receptor, which has been shown to bind HDL and LDL cholesterol and 
mediate uptake into cells (Acton, S. et al, (1996) Science 277:518-520). SR-BI is likely 
to contribute to genetic lipoprotein variability, thereby playing a role in the development 
of lipid metabolism disorders and thus generally, cardiovascular diseases. 

In addition, cholesterol gallstone formation could be caused by a defective SR-BI 
receptor, since the SR-BI receptor is likely to be involved in transferring HDL-cholesterol 
from extrahepatic tissues to the liver (reverse cholesterol transport) e.g. for incorporation 
into bile (J. L. Breslow, in The Metabolic and Molecular Bases of Inherited Disease, C.R. 
Scriver, A.L. Beaudet, W. Sly, D. Valle, Eds. (McGraw-Hill, New York, 1995), pp 2031- 
2052; S.M. Grundy, (1995) J. Am. Med Assoc. 256: 2849; G. Assman, A. von Eckardstein, 
H.B. Brewer Jr. in The Metabolic and Molecular Bases of Inherited Disease, C.R. Scriver, 
A.L. Beaudet, W. Sly, D. Valle, Eds. (McGraw-Hill, New York, 1995), pp 2053-2072; WJ. 
Johnson et al., (1991) Biochem. Biophys. Acta 1085:273; M.N. Pieters et al., (1994) Ibid 
7225:125; and C. J. Fielding and P. E. Fielding, (1995) J. Lipid Res 56:211). 

Further, a defective SR-BI receptor or abnormal levels of SR-BI receptor could 
influence the fertility of a subject, since SR-BI appears to be involved in HDL-cholesteryl 
ester delivery to steroidogenic tissues (ovary, adrenal glands and testis) for hormone 
synthesis (Acton, S. et al., (1 996) Science 277:518-520; Landschulz, et al. 5 (1996)7. Clin. 
Invest. 95/984-95; J. M. Anderson and J. M. Dietschy (1981) J. Biol. Chem. 256: 7362; 
M.S. Brown et al., (1979) Recent Prog Horm. Res. 55:215; J.T. Gwynne and J.F. Strauss 
III, (1982) Endocr. Rev. 3:299; B.D. Murphy et al., (1985) Endocrinology 116: 1587). 

The SR-BI receptor (Scavenger Receptor-BI) is a scavenger receptor that mediates 
endocytosis of unmodified and modified lipoproteins, e.g., LDL, acetylated LDL, oxidized 
LDL (Acton et al. (1994) J. Biol. Chem. 269:21003), HDL ((Acton, S. et al., (1996) Science 
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277:518-520), anionic phospholipids (Rigotti et al. (1995) J. Biol. Chem. 270:16221), 
negatively charged liposomes and apoptotic cells (Fukasawa et al. (1996) Exp. Cell Res. 
222:246). The human SR-BI receptor (also termed "CLA-1) exists in two differentially 
spliced forms (Calvo and Vega (1993) J. Biol. Chem. 268: 18929). The predominant form 

5 of human SR-BI is a protein of 509 amino acids. The shorter form of the SR-BI receptor 
has 409 amino acids, and is lacking the 100 amino acids located 42 amino acids 
. downstream of the initiation codon (Calvo and Vega, supra). The nucleotide sequence of 
a cDNA encoding human SR-BI is disclosed in Calvo and Vega, supra) and the nucleotide 
sequence of a cDNA encoding hamster SR-BI is disclosed in Acton et al. (1994) J. Biol. 

10 Chem. 269:21003 and in PCT Application WO 96/00288. 

2. Summary of the Invention 

The present invention is based at least in part on the discovery of the genomic 
structure of the human SR-BI gene and on the identification of polymorphic regions within 

1 5 the gene, which are associated with specific diseases or disorders, including abnormal body 
mass and abnormal lipoprotein levels, i.e., high HDL and low LDL levels. The human SR- 
BI gene contains 12 coding exons and one non coding exon (exon 13). The structure of the 
gene and the position of the introns relative to the nucleotide sequence of the exons are 
shown in Figures 1, 2, and 3. 

20 Several polymorphic regions that are associated with specific diseases or disorders, 

have been found in the human SR-BI gene by analyzing the DNA of a specific population 
of individuals. One polymorphism found in the population is a change from a guanine to 
an adenine at position 1 46 in exon 1 , which results in a change from a glycine to a serine 
at amino acid residue 2 of the encoded protein. A second polymorphism is a change from 

25 a guanine to an adenine at position 1 19 in exon 3, which results in a change from a valine 
to an isoleucine at amino acid residue 135 of the encoded protein. A third polymorphism 
is a change from a cytidine to a thymidine at position 41 of exon 8, which does not result 
in a difference in the amino acid sequence of the encoded protein. A fourth polymorphism 
is a change from a cytidine to a thymidine at position 56 of intron 5. A fifth polymorphism 

30 is a change from a cytidine to a guanine at position -41 of intron 10 (position -1 
corresponds to the first nucleotide upstream of exon 11). 
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Specific allelic variants of these polymorphic regions are shown herein to be 
associated with specific disorders. In particular, the presence of a thymidine at position 41 
in exon 8 was found to be associated with low plasma LDL levels in women. Furthermore, 
a thymidine at position 54 of intron 5 was found to be associated with a high BMI and high 

5 plasma LDL levels in women. In men, the presence of a thymidine at position 41 in exon 
8, a thymidine at position 54 of intron 5 and/or an adenine at position 146 of exon 1 was 
found to be associated with a high plasma HDL level. Since abnormal lipid, lipoprotein 
levels, and BMI may be associated with obesity, cachexia, cardiovascular disease, gallstone 
formation and other disorders. SR-BI polymorphic variants are directly or indirectly 

10 associated with obesity, cachexia, cardiovascular disease, gallstone formation and other 
disorders. . 

In one embodiment, the invention provides isolated nucleic acids comprising an 
intronic sequence from an SR-BI gene. In a preferred embodiment, the SR-BI gene is a 
human gene. In another preferred embodiment, the nucleic acid of the invention has a 
1 5 nucleotide sequence set forth in Figure 2 or in any of the intronic sequences in SEQ ID Nos. 
1-121, complements thereof, or homologs thereof. In yet another embodiment, the intronic 
sequence of the nucleic acid is capable of hybridizing under an appropriate stringency to 
a nucleic acid having an intronic nucleotide sequence set forth in any of SEQ ID Nos. 1-121 
or complements thereof. 

20 Other preferred nucleic acids of the invention comprise at least an allelic variant of 

a polymorphic region. A preferred allele has a polymorphic region that is located in an 
exon and comprises, e.g., a nucleotide sequence set forth in SEQ ID NO: 65, SEQ ID NO: 
95, or SEQ ID NO:96 or a polymorphic region that is located in an intron and comprises, 
e.g., a nucleotide sequence set forth in SEQ ID NO: 66 or SEQ ID NO: 97. The isolated 

25 nucleic acid preferably comprises from about 15 to about 30 nucleotides and can comprise, 
e.g., a nucleotide sequence selected from the group consisting of SEQ ID NO: 41 to SEQ 
ID NO: 64 and SEQ ID NO: 89 to SEQ ID NO: 94. The isolated nucleic acid can be double 
stranded or single stranded and can further comprise a label. 

The nucleic acids of the invention can be used, e.g., in prognostic, diagnostic, and 

30 therapeutic methods. For example, the nucleic acids of the invention can be used as probes 
or primers to determine whether a subject has or is at risk of developing a disease or 
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disorder associated with a specific allelic variant of an SR-BI polymorphism, e.g., a disease 
or disorder associated with an aberrant SR-BI activity. 

The invention further describes vectors comprised of the claimed nucleic acids; host 
cells transfected with said vectors whether prokaryotic or eukaryotic; and transgenic non- 
human animals which contain a heterologous form of a functional or non-functional SR-BI 
allele described herein. Such a transgenic animal can serve as an animal model for 
studying, e.g., the effect of specific allelic variations, including mutations of an SR-BI gene 
or for use in drug screening or recombinant protein production. 

The invention further provides methods for determining the molecular structure of 
at least a portion of an SR-BI gene. In a preferred embodiment, the method comprises 
contacting a sample nucleic acid comprising an SR-BI gene sequence with a probe or 
primer having a sequence which is complementary to an SR-BI gene sequence and 
comparing the molecular structure of the sample nucleic acid with the molecular structure 
of a control (known) SR-BI gene (e.g., an SR-BI gene from a human not afflicted with a 
cardiovascular condition or a disease associated with an aberrant SR-BI activity). The 
method of the invention can be used for example in determining the molecular structure of 
at least a portion of an exon, an intron, or the promoter. In a preferred embodiment, the 
method comprises determining the identity of at least one nucleotide. In even more 
preferred embodiments, the nucleotide is nucleotide 146 of exon 1, nucleotide 119 of exon 
3, nucleotide 41 of exon 8. nucleotide 54 of intron 5, and/or nucleotide -41 of intron 10. 
In another preferred embodiment, the method comprises determining the nucleotide content 
of at least a portion of an SR-BI gene, such as by sequence analysis. In yet another 
embodiment, determining the molecular structure of at least a portion of an SR-BI gene is 
carried out by single-stranded conformation polymorphism. In yet another embodiment, 
the method is an oligonucleotide ligation assay. Other methods within the scope of the 
invention for determining the molecular structure of at least a portion of an SR-BI gene 
include hybridization of allele-specific oligonucleotides, sequence specific amplification, 
and primer specific extension. 

In at least some of the methods of the invention, the probe or primer is allele 
specific. Preferred probes or primers are single stranded nucleic acids, which optionally 
are labeled. 
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The methods of the invention can be used for determining the identity of the allelic 
variant of a polymorphic region of a human SR-BI gene present in a subject. For example, 
the method of the invention can be useful for determining whether a subject has, or is at risk 
of developing, a disease or condition associated with a specific allelic variant of a 
5 polymorphic region in the human SR-BI gene. In one embodiment, the disease or condition 
is characterized by an aberrant SR-BI activity, such as an aberrant SR-BI protein level, 
which can result from an aberrant expression of an SR-BI gene. The disease or condition 
can be an abnormal lipid metabolism, inappropriate lipid or lipoprotein level, an abnormal 
body mass index, atherosclerosis, or gallstone formation. Accordingly, the invention 

10 provides methods for predicting or diagnosing cardiovascular diseases and other diseases 
associated with an aberrant SR-BI activity. 

For example, a female subject having the more common allele (a cytidine) at residue 
41 of exon 8 of SR-BI has or is likely to have a tendency of having higher LDL levels than 
a female subject having a thymidine at that position, thereby being at a higher risk of 

15 developing a cardiovascular disease; a female subject having the less common allele (a 
thymidine) at residue 54 of intron 5 has or is likely to develop a high BMI and/or high LDL 
levels, relative to a female subject having a cytidine at that position; and a male subject 
having the more common allele at residue 41 of exon 8, the more common allele at residue 
54 of intron 5, and the more common allele at residue 146 of exon 1 is likely to have or to 

20 develop lower HDL levels relative to a subject male having the less common alleles of at 
least one of these polymorphic regions, thereby being at a higher risk of developing a 
cardiovascular disease. 

The methods of the invention can also be used in selecting the appropriate drug to 
administer to a subject to treat a disease or condition, such as an abnormal lipid 

25 metabolism, inappropriate lipid level, a cardiovascular disease such as atherosclerosis, 
gallstone formation, or an abnormal body mass index. In fact, specific allelic variants of 
SR-BI polymorphic regions may be associated with a specific response of an individual 
having such an allele to a specific drug. For example, a specific SR-BI allele may encode 
an SR-BI protein having a modified affinity for certain types of molecules, e.g, lipids. 

30 Accordingly, the action of a drug necessitating interaction with an SR-BI protein will be 
different in individuals carrying such an SR-BI allele. 
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In a further embodiment, the invention provides a method for treating a subject 
having a disease or condition associated with a specific allelic variant of a polymorphic 
region of an SR-BI gene. In one embodiment the method comprises (a) determining the 
identity of the allelic variant; and (b) administering to the subject a compound that 

5 compensates for the effect of the specific allelic variant. In a preferred embodiment, the 
specific allelic variant is a mutation. The mutation can be located, e.g., in a promoter 
region, an intron, or an exon of the gene. In one embodiment, the compound modulates 
(i.e., agonizes or antagonizes) SR-BI protein levels. In a preferred embodiment, the 
compound is selected from the group consisting of a nucleic acid, a protein, a 

10 peptidomimetic, or a small molecule. The compound can be, an for example, SR-BI 
protein. Thus, e.g., if a female subject has the more common allele of residue 41 of exon 
8, high LDL levels and resulting cardiovascular disorders can be prevented from occurring 
or can be reduced, by administering to the subject a pharmaceutical^ effective amount of 
a compound which reduces LDL level to a normal LDL level. Similarly, if a female subject 

15 has the less common allele of residue 54 of intron 5, a high BMI and/or LDL level and 
consequences thereof, such as diabetes and cardiovascular disorders, can be prevented from 
occurring or can be reduced, by administering to the subject a pharmaceutical^ effective 
amount of a compound which reduces the BMI and/or the LDL levels. If, on the other 
hand, a male subject has the more common allele at residue 4 1 of exon 8, the more common 

20 allele at residue 54 of intron 5, and the more common allele at residue 146 of exon 1, 
development of low HDL levels can be prevented or increased by administering to the 
subject a pharmaceutically effective amount of a compound that increases HDL levels. 

The invention also provides probes and primers comprising substantially purified 
oligonucleotides, which correspond to a region of nucleotide sequence which hybridizes 

25 to at least 6 consecutive nucleotides of the sequence set forth as SEQ ID Nos: 1, 2, or 3 or 
to the complement of the sequences set forth as SEQ ID Nos: 1, 2, or 3; or naturally 
occurring mutants thereof. In preferred embodiments, the probe/primer further includes a 
label group attached thereto, which is capable of being detected. 

In another embodiment, the invention provides a kit for amplifying and/or for 

30 determining the molecular structure of at least a portion of an SR-BI gene, comprising a 
probe or primer capable of hybridizing to an SR-BI gene and instructions for use. In one 
embodiment, the probe or primer is capable of hybridizing to an SR-BI intron. In another 
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embodiment, the probe or primer is capable of hybridizing to an allelic variant of a 
polymorphic region. In a preferred embodiment, the polymorphic region is located in an 
exon, such as exon 1, 3, or 8 or in an intron, such as intron 5 or 10. In a preferred 
embodiment, determining the molecular structure of a region of an SR-BI gene comprises 

5 determining the identity of the allelic variant of the polymorphic region. Determining the 
molecular structure of at least a portion of an SR-BI gene can comprise determining the 
identity of at least one nucleotide or determining the nucleotide composition, e.g., the 
nucleotide sequence. 

A kit of the invention can be used, e.g., for determining whether a subject has or is 

10 at risk of developing a disease associated with a specific allelic variant of a polymorphic 
region of an SR-BI gene. In a preferred embodiment, the invention provides a kit for 
determining whether a subject has or is at risk of developing a disease or condition 
associated with abnormal lipid metabolism, inappropriate lipid or lipoprotein levels, a 
cardiovascular disease such as atherosclerosis, gallstone formation, or an abnormal body 

15 mass index. The disease or condition can be associated with an aberrant SR-BI activity, 
which can result, e.g., from a mutation in the SR-BI gene. The kit of the invention can also 
be used in selecting the appropriate drug to administer to a subject to treat a disease or 
condition, such as a disease or condition set forth above. In fact, pharmacogenetic studies 
have shown that the genetic background of individuals play a role in determining the 

20 response of an individual to a specific drug. Thus, determining the allelic variants of SR-BI 
polymorphic regions of an individual can be useful in predicting how an individual will 
respond to a specific drug, e.g, a drug for treating a disease or disorder associated with an 
aberrant SR-BI activity and/or a cardiovascular disease or a disease associated with an 
aberrant lipid level. 

25 Other features and advantages of the invention will be apparent from the following 

detailed description and claims. 

3. Brief Desc ription of the Figures 

Figure I is a schematic depiction of the chromosomal structure of the human SR-BI 
30 gene indicating the introns (1 through 12) and exons (I-XIII). Black boxes represent coding 
exons (exons I-XII) and the white box represents the non-coding exon (exon XIII) boxed 
and the nucleotides in the newly identified alleles are indicated. 
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Figure 2 represents the nucleotide sequence of the exons (underlined sequence) of 
the human SR-BI gene, portions of the introns which are adjacent to the exons, and 3' 
promoter sequence (SEQ ID Nos. 5-40). The putative 5' end of the cDNA, as predicted by 
GRAIL is indicated in italics. The TATA-like box is indicated in italics and is boxed. Bold 
5 sequences correspond to the nucleotide sequence or the complement of the nucleotide 
sequence of preferred primers for amplifying each of the exons or a promoter region. The 
nucleotide polymorphisms in exons 1,3, and 8 and introns 5 and 10 are boxed. 

Figure 3 shows the nucleotide sequence of the full length human SR-BI cDNA 
(SEQ ID NO: 1) and the position of introns 1-12 relative to the nucleotide sequence of the 
10 exons. The nucleotide polymorphisms in exons 1,3. and 8 are boxed. 

Figure 4 is a graphic representing the mean LDL-C differences (+/- 95% 
Confidence intervals) between SR-BI genotypes carrying variant alleles and the wild-type 
genotype (1 1 1/1 1 1) in women. * significantly different from 111/111 (p<0.030). The 
differences between genotypes sharing letters are statistically significant (a: p=0.001; b: 
15 p=0.016;c:p=0.004). 

Figure 5 is a graphic representing the mean LDL-C differences (+/- 95% 
Confidence intervals) between SR-BI genotypes carrying variant alleles and the wild-type 
genotype (111/111) in men. *significantly different from 111/111 (p<0.030). The 
differences between genotypes sharing letters are statistically significant (a: p=0.001; b: 
20 p=0.016;c:p=0.004). 

Figure 6 is a graphic representing mean HDL-C differences (+/- 95% Confidence 
intervals) between SR-BI genotypes carrying variant alleles and the wild-type genotype 
(1 1 1/1 11) in women. 

Figure 7 is a graphic representing mean HDL-C differences (+/- 95% Confidence 
25 intervals) between SR-BI genotypes carrying variant alleles and the wild-type genotype 
(1 1 1/1 11) in men. *significantly different from 1 1 1/1 1 l(p<0.040). 

Figure 8 is a graphic representing mean BMI differences (+/- 95% Confidence 
intervals) between SR-BI genotypes carrying variant alleles and the wild-type genotype 
(1 1 l/l 1 1) in women. * significantly different from 111/111 (p=0.020). The differences 
30 between genotypes sharing letters are statistically significant (a: p=0.007; b: p=0.005; c: 
p=0.004). 
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Figure 9 is a graphic representing mean BMI differences (+/- 95% Confidence 
intervals) between SR-BI genotypes carrying variant alleles and the wild-type genotype 
(11 1/1 11) in men. 

5 4. Detailed Description of the Invention 

4.1 1 Qenerel 

The present invention is based at least in part on the discovery of the genomic 
structure of the human SR-BI gene and on the identification of polymorphic regions within 

1 0 the gene which correlate with specific diseases or conditions, such as an abnormal BMI or 
an abnormal lipoprotein (i.e., HDL or LDL) level. 

As shown in Figure 1 , the human SR-BI gene is at least 50 kilobase pairs long and 
has 12 coding exons, one non-coding exon (exon 13), and 12 introns. The exons are 
numbered 1 to 13 from 5' to 3' and the introns are labeled 1 through 12 from 5' to 3'. Exon 

15 1 corresponds to the first exon located downstream of the promoter and contains the 
initiation codon. Intron 1 is located immediately downstream of exon I (see Figure 1). The 
position of the introns relative to the nucleotide sequence of the full length cDNA encoding 
SR-BI is shown in Figure 2. The nucleotide sequence of the human SR-BI cDNA, shown 
in Figure 3 and in SEQ ID NO: 1 encodes a protein of 509 amino acids. SEQ ID NO. 1 

20 contains the nucleotide sequence of the cDNA disclosed in Calvo and Vega (1993) J. Biol. 
Chem. 268:18929, and contains in addition a complete 5' end. The amino acid sequence 
of the protein set forth in SEQ ID NO: 2 is identical to the Cla-I protein disclosed in Calvo 
and Vega (1993) J. Biol. Chem. 268:18929. As set forth in Calvo and Vega, supra, 
differential splicing of the human SR-BI gene also results in a short mRNA lacking 300 

25 nucleotides located 126 nucleotides downstream of the initiation codon, i.e., lacking exons 
2 and 3 set forth in Figure 3, which encodes a protein of 409 amino acids. The shorter 
protein is referred to herein as "splice variant". The nucleotide sequence of a full length 
cDNA encoding the splice variant is set forth in SEQ ID NO: 3 and the amino acid 
sequence of the SR-BI splice variant protein encoded by this nucleotide sequence is set 

30 forth in SEQ ID NO: 4. The splice variant is rare relative to the 509 amino acid SR-BI 
protein. 
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Figure 2 shows the nucleotide sequence of the 3' end of the SR-BI promoter. 
Additional promoter sequence is disclosed in U.S. Patent Application 08/812,204 by Acton, 
incorporated herein by reference. 

Set forth below in Table I are the locations and sizes of the exons in the human SR- 

5 BI gene relative to the nucleotide sequence of a full length cDNA encoding human SR-BI 
protein (SEQ ID NO: 1), in which nucleotide 1 corresponds to the first nucleotide in the 
isolated transcript. Table I also indicates the portions of the human SR-BI protein encoded 
by each of these exons. Amino acid 1 is the initiating methionine. Also indicated is the 
length of the intron located downstream of each of the exons. 

10 Table I 



cDNA Nucleotide position 


Amino acid position 


Size of intron 


Exon 1 


1-244 


1-42 


intron 1: >2827 


Exon 2 


245-402 


43-95 


intron 2: 2429 


Exon 3 


403-544 


95-142 


intron 3: 567 


Exon 4 


545-748 


143-210 


intron 4: 2229 


Exon 5 


749-844 


211-242 


intron 5: 1580 


Exon 6 


845-960 


243-281 


intron 6: >10532 


Exon 7 


961-1127 


281-337 


intron 7: >3985 


Exon 8 


1228-1246 


337-376 


intron 8: >1 1321 


Exon 9 


1247-1320 


377-401 


intron 9: 7562 


Exon 10 


1321-1372 


401-418 


intron 10: 902 


Exon 1 1 


1373-1519 


419-467 


intron 11: 3547 


Exon 12 


1520-1648 


468-509 


intron 12: >4578 


Exon 13 


1649-2630 







25 

Figure 2 shows the nucleotide sequence of portions of the introns which are adjacent 
to the exons. The nucleotide sequence of each of the exons and adjacent portions of introns 
shown in Figure 2 are set forth in SEQ ID Nos. 5 to 16. The portions of each of the introns 
shown in Figure 2 are set forth in SEQ ID Nos. 1 8 to 40. For convenience, the identity of 
30 the sequences referred to as SEQ ID Nos. 1 to 40 are set forth below in Table II: 



Table II 
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SEQ ID NO: 1 
SEQ ID NO: 2 
SEQ ID NO: 3 
and 

5 SEQ ID NO: 4 

SEQ ID NO: 5 
SEQ ID NO: 6 
SEQ ID NO: 7 

10 SEQ ID NO: 8 
SEQ ID NO: 9 
SEQ ID NO: 10 
SEQ ID NO: 11 
SEQ ID NO: 12 

15 SEQ ID NO: 13 
SEQ ID NO: 14 
SEQ ID NO: 15 
SEQ ID NO: 16 
SEQ ID NO: 17 

20 SEQ ID NO: 18 
SEQ ID NO: 19 
SEQ ID NO: 20 
SEQ ID NO: 21 
SEQ ID NO: 22 

25 SEQ ID NO: 23 
SEQ ID NO: 24 
SEQ ID NO: 25 
SEQ ID NO: 26 
SEQ ID NO: 27 

30 SEQ ID NO: 28 
SEQ ID NO: 29 
SEQ ID NO: 30 



full length cDNA encoding human SR-BI; 

full length amino acid sequence of human SR-BI protein; 

full length cDNA encoding splice variant of human SR-BI (Calvo 

Vega, supra); 

full length amino acid sequence of splice variant of human SR-BI 
protein (Calvo and Vega, supra); 
3' end of promoter, exon 1, and 5' end of intron 1; 
3' end of intron 1, exon 2, and 5' end of intron 2; 
3' end of intron 2, exon 3, and 5' end of intron 3; 
3' end of intron 3, exon 4, and 5* end of intron 4; 
3' end of intron 4, exon 5, and 5' end of intron 5; 
3' end of intron 5, exon 6, and 5' end of intron 6; 
3' end of intron 6, exon 7, and 5' end of intron 7; 
3 1 end of intron 7, exon 8, and 5' end of intron 8; 
3' end of intron 8, exon 9, and 5' end of intron 9; 
3' end of intron 9, exon 10, and 5* end of intron 10; 
3' end of intron 10, exon 1 1 , and 5' end of intron 1 1 ; 
3' end of intron 1 1, exon 12, and 5' end of intron 12; 
3' end of promoter; 
5' end of intron 1; 
3' end of intron 1 
5' end of intron 2 
3' end of intron 2 
5' end of intron 3 
3' end of intron 3 
5' end of intron 4 
3' end of intron 4 
5' end of intron 5 
3' end of intron 5 
5 f end of intron 6 
3' end of intron 6 
5' end of intron 7 
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SEQ ID NO: 3 1 3' end of intron 7; 

SEQ ID NO: 32 5' end of intron 8; 

SEQ ID NO: 33 3' end of intron 8; 

SEQ ID NO: 34 5* end of intron 9; 

5 SEQ ID NO: 35 3' end of intron 9; 

SEQ ID NO: 36 5* end of intron 10 

SEQ ID NO: 37 3' end of intron 10 

SEQ ID NO: 38 5' end of intron 1 1 

SEQ ID NO: 39 3' end of intron 1 1 

10 SEQ ID NO: 40 5' end of intron 12. 



and 



An analysis of the human SR-BI gene in a population of individuals chosen because 
these individuals had a known age, known HDL and LDL levels, known body mass index, 
and known triglycerides and total cholesterol levels revealed the existence of several 

1 5 polymorphisms in the SR-BI gene in this population. These polymorphisms were identified 
by performing single stranded conformation polymorphism (SSCP) analysis of genomic 
DNA from independent individuals as described in Example 3, using PCR primers 
complementary to intronic or promoter sequences surrounding each of the exons. The 
nucleotide sequence of these PCR primers (having SEQ ID Nos. 41-64) is shown in Table 

20 IV (in the Examples). 

The results indicated the presence of at least five polymorphic regions in the human 
SR-BI gene in the population studied. The location and identity of these polymorphisms 
is indicated in Table III. 



Table III 

Locations of polymorphisms in the human SR-BI gene. 

cDNA amino acid 

position position change location 



poly- 
morphism 

exon 14 2 Cly -> Ser 

exon 3 403 135 Val -> lie 

intron 5 na na na 



1 2 3 
ATG (G/A)GC TGC 

135 136 137 
(G/A)TC ATG CCC 

240 241 242 

CTG AGC AAG gtgaggggcgagaggcgagggcccctgt 
cgccagggaggggagggtgggcc(c/t)g 
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cxon 8 1050 350 none 350 351 352 

(G/A)TC ATG CCC 

intron 10 na na na 419 

c(c/g)tgcggccccagctcatgtgtttgtcattctgtctcctcag AGC 

420 421 
GGG GCC 

The intron is defined as being after its corresponding exon (intron I is 3' of exon 1 ). cDNA position 1= the first base 
of the initiator methionine, the numbers above the sequences refer to the amino acid number, na = not applicable, lower 
case indicates intronic sequence, and the polymorphisms are in parentheses. 

As can be seen in Table III, one polymorphism is a change from a guanine to an 
adenine at position 146 in exon 1, which results in a change from a glycine to a serine at 
amino acid residue 2 of the encoded protein. The nucleotide sequence of this allele is set 
forth in SEQ ID NO: 95 (which is identical to SEQ ID NO: 5, except for nucleotide 146 of 
exon 1 which is an adenine). 

A second polymorphism is a change from a guanine to an adenine at position 119 
in exon 3, which results in a change from a valine to an isoleucine at amino acid residue 
135 of the encoded protein. The nucleotide sequence of this allele is set forth in SEQ ID 
NO: 96 (which is identical to SEQ ID NO: 7, except for nucleotide 1 19 of exon 3 which is 
an adenine). 

A third polymorphism is a change from a cytidine to a thymidine at position 41 of 
exon 8, which does not result in a difference in the amino acid sequence of the encoded 
protein. The nucleotide sequence of exon 8 of this allele is set forth in SEQ ID NO: 65 
(SEQ ID NO: 65 is identical to SEQ ID NO: 12, except for nucleotide 41 of the exon 
sequence which is a thymidine). About 35 % of the individuals of a Spanish population of 
142 individuals were found to be homozygous for the allele having a cytidine at position 
41 (i.e., SR-BI sequence originally disclosed); about 17% of the individuals were found to 
be homozygous for the allele having a thymidine at position 41 of exon 8; and about 48% 
of the individuals were found to be heterozygous, i.e., having one allele having a cytidine 
at position 41 and one allele having a thymidine at position 41. 

A fourth polymorphism is a change from a cytidine to a thymidine at position 56 
of intron 5 (position 1 being defined as the first nucleotide in the intron). This nucleotide 
substitution destroys the Apal restriction site which is present when the nucleotide at 
position 54 is a cytidine. The nucleotide sequence of the 5' end of intron 5 of this allele is 
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set forth in SEQ ID NO: 66 (SEQ ID NO: 66 is identical to SEQ ID NO: 26, except for 

nucleotide 54 which is a thymidine). 

A fifth polymorphism in the SR-BI gene is a change from a cytidine to a guanine 

at position -41 of intron 10 (position -1 corresponds to the first nucleotide upstream of exon 
5 11). The nucleotide sequence of the 3' end of intron 1 0 of this allele is set forth in SEQ ID 

NO: 97 (SEQ ID NO: 97 is identical to SEQ ID NO: 15, except for nucleotide -41 of intron 

1 0 which is a guanine). 

Further analysis of the human SR-BI gene is likely to reveal the existence of yet 

other polymorphic regions. Such analysis can be performed using the methods described 
10 herein and genomic DNA from random subjects or subjects of families associated with 

specific diseases. For example, the polymorphism studies described herein can also be 

applied to populations in which cholesterol gallstones are prevalent. 

Accordingly, the invention provides nucleic acids, e.g, intronic sequences useful as 

probes or primers for determining the identity of an allelic variant of an SR-BI polymorphic 
15 region. The invention also provides methods for determining the identity of the alleles of 

a specific polymorphic region of an SR-BI gene. Such methods can be used, for example, 

to determine whether a subject has or is at risk of developing a disease or condition 

associated with one or more specific alleles of polymorphic regions of an SR-BI gene. In 

a preferred embodiment, the disease or condition is caused or contributed to by an aberrant 
20 SR-BI bioactivity. Other aspects of the invention are described below or will be apparent 

to one of skill in the art in light of the present disclosure. 

4.2 Definitions 

For convenience, the meaning of certain terms and phrases employed in the 
25 specification, examples, and appended claims are provided below. 

The term "allele", which is used interchangeably herein with "allelic variant" refers 
to alternative forms of a gene or portions thereof. Alleles occupy the same locus or position 
on homologous chromosomes. When a subject has two identical alleles of a gene, the 
subject is said to be homozygous for the gene or allele. When a subject has two different 
30 alleles of a gene, the subject is said to be heterozygous for the gene. Alleles of a specific 
gene can differ from each other in a single nucleotide, or several nucleotides, and can 
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inciude substitutions, deletions, and insertions of nucleotides. An allele of a gene can also 
be a form of a gene containing a mutation. 

The term "allelic variant of a polymorphic region of an SR-BI gene" refers to a 
region of an SR-BI gene having one of several nucleotide sequences found in that region 
5 of the gene in other individuals. 

"Biological activity" or "bioactivity" or "activity" or "biological function", which 
are used interchangeably, for the purposes herein when applied to SR-BI means an effector 
or antigenic function that is directly or indirectly performed by an SR-BI polypeptide 
(whether in its native or denatured conformation), or by any subsequence (fragment) 

10 thereof Biological activities include binding to a ligand, e.g., a lipid or lipoprotein, such 
as LDL or modified forms thereof, or HDL or modified forms thereof. Other molecules 
which can bind an SR-BI receptor include anionic molecules, such as anionic 
phospholipids, negatively charged liposomes, and apoptotic cells. Another biological 
activity of an SR-BI protein includes endocytosis of a ligand interacting with the receptor. 

15 A biological activity is also intended to include binding to a protein, such as binding to the 
cytoplasmic domain of SR-BI. Yet other biological activities include signal transduction 
from the receptor, modulation of expression of genes responsive to binding of a ligand to 
an SR-BI receptor, and other biological activities, whether presently known or inherent. 
An SR-BI bioactivity can be modulated by directly affecting an SR-BI protein. 

20 Alternatively, an SR-BI bioactivity can be modulated by modulating the level of an SR-BI 
protein, such as by modulating expression of an SR-BI gene. Antigenic functions include 
possession of an epitope or antigenic site that is capable of cross-reacting with antibodies 
raised against a naturally occurring or denatured SR-BI polypeptide or fragment thereof. 
Biologically active SR-BI polypeptides include polypeptides having both an effector 

25 and antigenic function, or only one of such functions. SR-BI polypeptides include 
antagonist polypeptides and native SR-BI polypeptides, provided that such antagonists 
include an epitope of a native SR-BI polypeptide. An effector function of SR-BI 
polypeptide can be the ability to bind to a ligand, e.g., a lipid or modified form thereof 
As used herein the term "bioactive fragment of a SR-BI protein" refers to a fragment 

30 of a full-length SR-BI protein, wherein the fragment specifically mimics or antagonizes the 
activity of a wild-type SR-BI protein. The bioactive fragment preferably is a fragment 
capable of binding to a second molecule, such as a ligand. 
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The term "an aberrant activity" or "abnormal activity", as applied to an activity of 
a protein such as SR-BI, refers to an activity which differs from the activity of the wild-type 
or native protein or which differs from the activity of the protein in a healthy subject, e.g., 
a subject not afflicted with a disease associated with a specific allelic variant of an SR-BI 
5 polymorphism. An activity of a protein can be aberrant because it is stronger than the 
activity of its native counterpart. Alternatively, an activity can be aberrant because it is 
weaker or absent related to the activity of its native counterpart. An aberrant activity can 
also be a change in an activity. For example an aberrant protein can interact with a different 
protein relative to its native counterpart. A cell can have an aberrant SR-BI activity due to 

1 0 overexpression or underexpression of the gene encoding SR-BI. An aberrant SR-BI activity 
can result, e.g., from a mutation in the gene, which results, e.g., in lower or higher binding 
affinity of a lipid to the SR-BI protein encoded by the mutated gene. An aberrant SR-BI 
activity can also result from a lower or higher level of SR-BI receptor on cells, which can 
result, e.g., from a mutation in the 5' flanking region of the SR-BI gene or any other 

15 regulatory element of the SR-BI gene, such as a regulatory element located in an intron. 
Accordingly, an aberrant SR-BI activity can result from an abnormal SR-BI promoter 
activity. 

The terms "abnormal SR-BI promoter activity", "aberrant SR-BI promoter activity", 
"abnormal SR-BI transcriptional activity" and "aberrant SR-BI transcriptional activity", 

20 which are used interchangeably herein, refer to the transcriptional activity of an SR-BI 
promoter which differs from the transcriptional activity of the same promoter in a healthy 
subject. An abnormal SR-BI activity can result from a higher or lower transcriptional 
activity than that in a healthy subject. An aberrant SR-BI promoter activity can result, e.g., 
from the presence of a genetic lesion in a regulatory element, such as in a regulatory 

25 element located in the promoter. An "aberrant SR-BI promoter activity" is also intended 
to refer to the transcriptional activity of an SR-BI promoter which is functional (capable 
of inducing transcription of a gene to which it is operably linked) in tissues or cells in 
which the "natural" or wild-type SR-BI promoter is not functional or which is non 
functional in tissues or cells in which the "natural" or wild-type SR-BI promoter is 

30 functional. Thus, a tissue distribution of SR-BI in a subject which differs from the tissue 
distribution of SR-BI in a "normal" or "healthy" subject, can be the result of an abnormal 
transcriptional activity from the SR-BI promoter region. Such an abnormal transcriptional 
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activity can result, e.g., from one or more mutations in a promoter region, such as in a 
regulatory element thereof. An abnormal transcriptional activity can also result from a 
mutation in a transcription factor involved in the control of SR-BI gene expression. 

The term "body mass index" or "BMI" refers to the ratio of weight (kg)/height (m 2 ) 

5 and can be used to define whether a subject is overweight. Typically, a subject is 
underweight if he has a BMK20; normal if he has a BMI of 20-25, overweight if he has a 
BMI of 25-30, obese if he has a BMI of 30-40 and severely obese if he has a BMI>40. 

As used herein, a subject has an "abnormal body mass" or "abnormal body mass 
index" or "aberrant body mass" or "aberrant body mass index" if his body mass index is 

10 outside the range defined for a healthy or normal subject, i.e., BMI of 20-25. A disorder 
of body mass include any disorder affecting the body mass of a subject such that his body 
mass is outside the normal range. For example, obesity is a disorder of body mass. 
Wasting is also a disorder of body mass. An abnormal body mass index can have a 
hormonal origin, e.g., in premenopausal women. 

15 The term "cardiovascular disorder" refers to a disease or disorder of the 

cardiovascular system and includes ischemia, restenosis, congestive heart failure, and 
atherosclerosis. 

"Cells," "host cells" or "recombinant host cells" are terms used interchangeably 
herein. It is understood that such terms refer not only to the particular subject cell but to 
20 the progeny or potential progeny of such a cell. Because certain modifications may occur 
in succeeding generations due to either mutation or environmental influences, such progeny 
may not, in fact, be identical to the parent cell, but are still included within the scope of the 
term as used herein. 

As used herein, the term "gene" or "recombinant gene" refers to a nucleic acid 
25 molecule comprising an open reading frame and including at least one exon and 
(optionally) an intron sequence. The term "intron" refers to a DNA sequence present in a 
given gene which is spliced out during mRNA maturation. 

"Homology" or "identity" or "similarity" refers to sequence similarity between two 
peptides or between two nucleic acid molecules. Homology can be determined by 
30 comparing a position in each sequence which may be aligned for purposes of comparison. 
When a position in the compared sequence is occupied by the same base or amino acid, 
then the molecules are homologous at that position. A degree of homology between 
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sequences is a function of the number of matching or homologous positions shared by the 
sequences. An "unrelated" or "non-homologous" sequence shares less than 40 % identity, 
though preferably less than 25 % identity, with one of the sequences of the present 
invention. 

5 The term " a homolog of a nucleic acid" refers to a nucleic acid having a nucleotide 

sequence having a certain degree of homology with the nucleotide sequence of the nucleic 
acid or complement thereof. A homolog of a double stranded nucleic acid having SEQ ID 
NO: x is intended to include nucleic acids having a nucleotide sequence which has a certain 
degree of homology with SEQ ID NO: x or with the complement thereof. Preferred 

10 homologs of nucleic acids are capable of hybridizing to the nucleic acid or complement 
thereof. 

The term "interact" as used herein is meant to include detectable interactions 
between molecules, such as can be detected using, for example, a hybridization assay. The 
term interact is also meant to include "binding" interactions between molecules. 

15 Interactions may be, for example, protein-protein, protein-nucleic acid, protein-small 
molecule or small molecule-nucleic acid in nature. 

The term "intronic sequence" or "intronic nucleotide sequence" refers to the 
nucleotide sequence of an intron or portion thereof . 

The term "isolated" as used herein with respect to nucleic acids, such as DNA or 

20 RNA, refers to molecules separated from other DNAs or RNAs, respectively, that are 
present in the natural source of the macromolecule. The term isolated as used herein also 
refers to a nucleic acid or peptide that is substantially free of cellular material, viral 
material, or culture medium when produced by recombinant DNA techniques, or chemical 
precursors or other chemicals when chemically synthesized. Moreover, an "isolated nucleic 

25 acid" is meant to include nucleic acid fragments which are not naturally occurring as 
fragments and would not be found in the natural state. The term "isolated" is also used 
herein to refer to polypeptides which are isolated from other cellular proteins and is meant 
to encompass both purified and recombinant polypeptides. 

The term "lipid" shall refer to a fat or fat-like substance that is insoluble in polar 

30 solvents such as water. The term "lipid" is intended to include true fats (e.g. esters of fatty 
acids and glycerol); lipids (phospholipids, cerebrosides, waxes); sterols (cholesterol, 
ergosterol) and lipoproteins (e.g. HDL, LDL and VLDL). 
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The term "locus" refers to a specific position in a chromosome. For example, a 
locus of an SR-BI gene refers to the chromosomal position of the SR-BI gene. 

The term "modulation" as used herein refers to both upregulation, (i.e., activation 
or stimulation), for example by agonizing; and downregulation (i.e. inhibition or 
5 suppression), for example by antagonizing of a bioactivity (e.g. expression of a gene). 

The term "molecular structure" of a gene or a portion thereof refers to the structure 
as defined by the nucleotide content (including deletions, substitutions, additions of one or 
more nucleotides), the nucleotide sequence, the state of methylation, and/or any other 
modification of the gene or portion thereof. 
10 The term "mutated gene" refers to an allelic form of a gene, which is capable of 

altering the phenotype of a subject having the mutated gene relative to a subject which does 
not have the mutated gene. If a subject must be homozygous for this mutation to have an 
altered phenotype, the mutation is said to be recessive. If one copy of the mutated gene is 
sufficient to alter the genotype of the subject, the mutation is said to be dominant. If a 
1 5 subject has one copy of the mutated gene and has a phenotype that is intermediate between 
that of a homozygous and that of a heterozygous (for that gene) subject, the mutation is said 
to be co-dominant. 

As used herein, the term "nucleic acid" refers to polynucleotides such as 
deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic acid (RNA). The term 

20 should also be understood to include, as equivalents, derivatives, variants and analogs of 
either RNA or DNA made from nucleotide analogs, and, as applicable to the embodiment 
being described, single (sense or antisense) and double-stranded polynucleotides. 
Deoxyribonucleotides include deoxyadenosine, deoxycytidine, deoxyguanosine, and 
deoxythymidine. For purposes of clarity, when referring herein to a nucleotide of a nucleic 

25 acid, which can be DNA or an RNA, the terms "adenosine", "cytidine", "guanosine", and 
thymidine" are used. It is understood that if the nucleic acid is RNA, a nucleotide having 
a uracil base is uridine. 

The term "nucleotide sequence complementary to the nucleotide sequence set forth 
in SEQ ID NO: x" refers to the nucleotide sequence of the complementary strand of a 

30 nucleic acid strand having SEQ ID NO: x. The term "complementary strand" is used herein 
interchangeably with the term "complement". The complement of a nucleic acid strand can 
be the complement of a coding strand or the complement of a non-coding strand. When 
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referring to double stranded nucleic acids, the complement of a nucleic acid having SEQ 
ID NO: x refers to the complementary strand of the strand having SEQ ID NO: x or to any 
nucleic acid having the nucleotide sequence of the complementary strand of SEQ ID NO: 
x. When referring to a single stranded nucleic acid having the nucleotide sequence SEQ 

5 ID NO: x, the complement of this nucleic acid is a nucleic acid having a nucleotide 
sequence which is complementary to that of SEQ ID NO: x. The nucleotide sequences and 
complementary sequences thereof are always given in the 5' to 3' direction. The term 
"complement" and "reverse complement" are used interchangeably herein. 

A "non-human animal" of the invention can include mammals such as rodents, 

10 non-human primates, sheep, goats, horses, dogs, cows, chickens, amphibians, reptiles, etc. 
Preferred non-human animals are selected from the rodent family including rat and mouse, 
most preferably mouse, though transgenic amphibians, such as members of the Xenopus 
genus, and transgenic chickens can also provide important tools for understanding and 
identifying agents which can affect, for example, embryogenesis and tissue formation. The 

15 term "chimeric animal" is used herein to refer to animals in which an exogenous sequence 
is found, or in which an exogenous sequence is expressed in some but not all cells of the 
animal. The term "tissue-specific chimeric animal" indicates that an exogenous sequence 
is present and/or expressed or disrupted in some tissues, but not others. 

The term "operably linked" is intended to mean that the promoter is associated with 

20 the nucleic acid in such a manner as to facilitate transcription of the nucleic acid from the 
promoter. 

The term "polymorphism" refers to the coexistence of more than one form of a gene 
or portion thereof. A portion of a gene of which there are at least two different forms, i.e., 
two different nucleotide sequences, is referred to as a "polymorphic region of a gene". A 
25 polymorphic region can be a single nucleotide, the identity of which differs in different 
alleles. A polymorphic region can also be several nucleotides long. 

A "polymorphic gene" refers to a gene having at least one polymorphic region. 
The terms "protein", "polypeptide" and "peptide" are used interchangeably herein 
when referring to a gene product. 
30 The term "recombinant protein" refers to a polypeptide which is produced by 

recombinant DNA techniques, wherein generally, DNA encoding the polypeptide is 
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inserted into a suitable expression vector which is in turn used to transform a host cell to 
produce the heterologous protein. 

A "regulatory element", also termed herein "regulatory sequence is intended to 
include elements which are capable of modulating transcription from a basic promoter and 

5 include elements such as enhancers and silencers. The term "enhancer", also referred to 
herein as "enhancer element", is intended to include regulatory elements capable of 
increasing, stimulating, or enhancing transcription from a basic promoter. The term 
"silencer", also referred to herein as "silencer element" is intended to include regulatory 
elements capable of decreasing, inhibiting, or repressing transcription from a basic 

10 promoter. Regulatory elements are typically present in 5' flanking regions of genes. 
However, regulatory elements have also been shown to be present in other regions of a 
gene, in particular in introns. Thus, it is possible that SR-BI genes have regulatory 
elements located in introns, exons, coding regions, and 3' flanking sequences. Such 
regulatory elements are also intended to be encompassed by the present invention and can 

15 be identified by any of the assays that can be used to identify regulatory elements in 5' 
flanking regions of genes. 

The term "regulatory element" further encompasses "tissue specific" regulatory 
elements, i.e., regulatory elements which effect expression of the selected DNA sequence 
preferentially in specific cells (e.g., cells of a specific tissue). Gene expression occurs 

20 preferentially in a specific cell if expression in this cell type is significantly higher than 
expression in other cell types. The term "regulatory element" also encompasses non-tissue 
specific regulatory elements, i.e., regulatory elements which are active in most cell types. 
Furthermore, a regulatory element can be a constitutive regulatory element, i.e., a 
regulatory element which constitutively regulates transcription, as opposed to a regulatory 

25 element which is inducible, i.e., a regulatory element which is active primarily in response 
to a stimulus. A stimulus can be, e.g., a molecule, such as a hormone, cytokine, heavy 
metal, phorbol ester, cyclic AMP (cAMP), or retinoic acid. 

Regulatory elements are typically bound by proteins, e.g., transcription factors. 
The term "transcription factor" is intended to include proteins or modified forms thereof, 

30 which interact preferentially with specific nucleic acid sequences, i.e., regulatory elements, 
and which in appropriate conditions stimulate or repress transcription. Some transcription 
factors are active when they are in the form of a monomer. Alternatively, other 
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transcription factors are active in the form of a dirtier consisting of two identical proteins 
or different proteins (heterodimer). Modified forms of transcription factors are intended 
to refer to transcription factors having a postranslational modification, such as the 
attachment of a phosphate group. The activity of a transcription factor is frequently 

5 modulated by a postranslational modification. For example, certain transcription factors 
are active only if they are phosphorylated on specific residues. Alternatively, transcription 
factors can be active in the absence of phosphorylated residues and become inactivated by 
phosphorylation. A list of known transcription factors and their DNA binding site can be 
found, e.g., in public databases, e.g., TFMATRIX Transcription Factor Binding Site Profile 

1 0 database. 

As used herein, the term "specifically hybridizes" or "specifically detects" refers to 
the ability of a nucleic acid molecule of the invention to hybridize to at least approximately 
6, 12, 20, 30, 40, 50, 60, 70, 80, 90, 100, 1 10, 120, 130 or 140 consecutive nucleotides of 
either strand of an SR-BI gene. 

1 5 "SR-BI" or "SR-BI receptor" refers to a class B scavenger receptor that has been 

shown to bind HDL cholesterol and mediate uptake into cells (Acton, S. et al., Science 
277:518-520). SR-BI has also been shown to bind with high affinity to modified proteins 
(e.g. acetylated LDL, oxidized LDL, maleylated bovine serum albumin) and native LDL 
(Acton, et al., (1994) J. Biochem, 269:21003-21009). Further, SR-BI has been shown to 

20 bind anionic phospholipids, such as phosphatidylserine and phosphatidylinositol, but not 
zwitterionic phospholipids, such as phosphatidylcholine, phosphatidylethanolamine and 
sphingomyelin. Competition studies suggest that anionic phospholipids bind to SR-BI at 
a site close to or identical with the sites of native and modified LDL binding and that the 
interaction may involve polyvalent binding via multiple anionic phospholipid molecules 

25 (Rigotti, A.., S. Acton and M. Krieger (1995) 1 Biochem 270:16221-16224). SR-BI has 
also been shown to bind to negatively charged liposomes and apoptotic cells. The human 
SR-BI protein is described in Calvo et al. (1993) J. Biol. Chem. 268:1 8929 and hamster SR- 
BI is described in International Patent Application Number WO 96/00288 entitled "Class 
Bl and CI Scavenger Receptors" by Acton, S. et al. 

30 The term "SR-BI therapeutic" refers to various forms of SR-BI polypeptides, as 

well as peptidomimetics, nucleic acids, or small molecules, which can modulate at least one 
activity of an SR-BI by mimicking or potentiating (agonizing) or inhibiting (antagonizing) 
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the effects of a naturally-occurring SR-BI polypeptide. An SR-BI therapeutic which 
mimics or potentiates the activity of a wild-type SR-BI polypeptide is a "SR-BI agonist". 
Conversely, an SR-BI therapeutic which inhibits the activity of a wild-type SR-BI 
polypeptide is a "SR-BI antagonist". SR-BI therapeutics can be used to treat diseases 

5 which are associated with a specific SR-BI allele which encodes a protein having an amino 
acid sequence that differs from that of the wild-type SR-BI protein. 

As used herein, the term "transfection" means the introduction of a nucleic acid, 
e.g., an expression vector, into a recipient cell by nucleic acid-mediated gene transfer. The 
term "transduction" is generally used herein when the transfection with a nucleic acid is by 

1 0 viral delivery of the nucleic acid. "Transformation", as used herein, refers to a process in 
which a cell's genotype is changed as a result of the cellular uptake of exogenous DNA or 
RNA, and, for example, the transformed cell expresses a recombinant form of a polypeptide 
or, in the case of anti-sense expression from the transferred gene, the expression of a 
naturally-occurring form of the recombinant protein is disrupted. 

15 As used herein, the term "transgene" refers to a nucleic acid sequence which has 

been introduced into a cell. Daughter cells deriving from a cell in which a transgene has 
been introduced are also said to contain the transgene (unless it has been deleted). A 
transgene can encode, e.g., a polypeptide, or an antisense transcript, partly or entirely 
heterologous, i.e., foreign, to the transgenic animal or cell into which it is introduced, or, 

20 is homologous to an endogenous gene of the transgenic animal or cell into which it is 
introduced, but which is designed to be inserted, or is inserted, into the animal's genome 
in such a way as to alter the genome of the cell into which it is inserted (e.g., it is inserted 
at a location which differs from that of the natural gene or its insertion results in a 
knockout). Alternatively, a transgene can also be present in an episome. A transgene can 

25 include one or more transcriptional regulatory sequence and any other nucleic acid, (e.g. 
intron), that may be necessary for optimal expression of a selected nucleic acid. 

A "transgenic animal" refers to any animal, preferably a non-human animal, e.g. a 
mammal, bird or an amphibian, in which one or more of the cells of the animal contain 
heterologous nucleic acid introduced by way of human intervention, such as by transgenic 

30 techniques well known in the art. The nucleic acid is introduced into the cell, directly or 
indirectly by introduction into a precursor of the cell, by way of deliberate genetic 
manipulation, such as by microinjection or by infection with a recombinant virus. The term 
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genetic manipulation does not include classical cross-breeding, or in vitro fertilization, but 
rather is directed to the introduction of a recombinant DNA molecule. This molecule may 
be integrated within a chromosome, or it may be extrachromosomally replicating DNA. 
In the typical transgenic animals described herein, the transgene causes cells to express a 

5 recombinant form of one of a protein, e.g. either agonistic or antagonistic forms. However, 
transgenic animals in which the recombinant gene is silent are also contemplated, as for 
example, the FLP or CRE recombinase dependent constructs described below. Moreover, 
"transgenic animal" also includes those recombinant animals in which gene disruption of 
one or more genes is caused by human intervention, including both recombination and 

1 0 antisense techniques. 

The term "treating" as used herein is intended to encompass curing as well as 
ameliorating at least one symptom of the condition or disease. 

As used herein, the term "vector" refers to a nucleic acid molecule capable of 
transporting another nucleic acid to which it has been linked. One type of preferred vector 

15 is an episome, i.e., a nucleic acid capable of extra-chromosomal replication. Preferred 
vectors are those capable of autonomous replication and/or expression of nucleic acids to 
which they are linked. Vectors capable of directing the expression of genes to which they 
are operatively linked are referred to herein as "expression vectors". In general, expression 
vectors of utility in recombinant DNA techniques are often in the form of "plasmids" which 

20 refer generally to circular double stranded DNA loops which, in their vector form are not 
bound to the chromosome. In the present specification, "plasmid" and "vector" are used 
interchangeably as the plasmid is the most commonly used form of vector. However, the 
invention is intended to include such other forms of expression vectors which serve 
equivalent functions and which become known in the art subsequently hereto. 

25 The term "wild-type allele" refers to an allele of a gene which, when present in two 

copies in a subject results in a wild-type phenotype. There can be several different wild- 
type alleles of a specific gene, since certain nucleotide changes in a gene may not affect the 
phenotype of a subject having two copies of the gene with the nucleotide changes. 

30 Nucleic Acids of the Pres ent Invention 

As described below, one aspect of the invention pertains to isolated nucleic acids 
comprising an intronic sequence of an SR-BI gene. In a preferred embodiment, the 
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invention provides an intronic sequence of the genomic DNA sequence encoding an SR-BI 
protein, comprising an intronic sequence shown in Figure 2 or set forth in any of SEQ ID 
NOs. 1-121 or complements thereof or homologs thereof. Other preferred nucleic acids of 
the invention include specific SR-BI alleles, which differ from the allelic variant having the 

5 nucleotide sequence set forth in SEQ ID NO: 1 or SEQ ID NO: 3, or at least a portion 
thereof having a polymorphic region. Nucleic acids of the invention can function as probes 
or primers, e.g., in methods for determining the identity of an allelic variant of an SR-BI 
polymorphic region. The nucleic acids of the invention can also be used to determine 
whether a subject is at risk of developing a disease associated with a specific allelic variant 

10 of an SR-BI polymorphic region, e.g, a disease or disorder associated with an aberrant SR- 
BI activity. The nucleic acids of the invention can further be used to prepare SR-BI 
polypeptides encoded by specific alleles, such as mutant alleles. Such polypeptides can be 
used in gene therapy. Polypeptides encoded by specific SR-BI alleles, such as mutant SR- 
BI polypeptides, can also be used for preparing reagents, e.g., antibodies, for detecting SR- 

15 BI proteins encoded by these alleles. Accordingly, such reagents can be used to detect 
mutant SR-BI proteins. 

Certain nucleic acids of the invention comprise an intronic sequence of an SR-BI 
gene. The term "SR-BI intronic sequence" refers to a nucleotide sequence of an intron of 
an SR-BI gene. An intronic sequence can be directly adjacent to an exon or located further 

20 away from the exons. Preferred nucleic acids of the invention include an intronic sequence 
of an SR-BI gene which is adjacent to an exon and comprises at least about 3 consecutive 
nucleotides, at least about 6 consecutive nucleotides, at least about 9 consecutive 
nucleotides, at least about 12 consecutive nucleotides, at least about 15 consecutive 
nucleotides, at least about 1 8 consecutive nucleotides, or at least about 20 consecutive 

25 nucleotides. Isolated nucleic acids which comprise an SR-BI intronic sequence which is 
immediately adjacent to an exon and comprises at least about 25 consecutive nucleotides, 
at least about 30 consecutive nucleotides, at least about 35 consecutive nucleotides, at least 
about 40 consecutive nucleotides, at least about 50 consecutive nucleotides, or at least 
about 100 consecutive nucleotides are also within the scope of the invention. Preferred 

30 isolated nucleic acids of the invention also include those having an SR-BI intronic sequence 
having a nucleotide sequence of at least about 10 nucleotides, at least about 15 nucleotides, 
at least about 20 nucleotides, at least about 25 nucleotides, at least about 30 nucleotides, at 



WO 99/02735 



PCT/US98/14354 



-27- 

least about 35 nucleotides, at least about 40 nucleotides, at least about 50 nucleotides or at 
least about 100 nucleotides. Other preferred nucleic acids of the invention can comprise 
an SR-BI intronic sequence having less than about 10 nucleotides, provided that the 
nucleotide sequence is novel. Yet other preferred isolated nucleic acids of the invention 
5 include SR-BI intronic nucleic acid sequences of an SR-BI intron, having at least about 1 50 
consecutive nucleotides, at least about 200 consecutive nucleotides, at least about 250 
consecutive nucleotides, at least about 300 consecutive nucleotides, at least about 350 
consecutive nucleotides, at least about 400 consecutive nucleotides, at least about 500 
consecutive nucleotides or at least about 1000 consecutive nucleotides 

10 Preferred nucleic acids of the invention comprise an SR-BI intronic sequence 

having a nucleotide sequence shown in Figure 2, and/or in any of SEQ ID Nos. 1-121, 
complement thereof, reverse complement thereof or homolog thereof. In a preferred 
embodiment, the invention provides an isolated nucleic acid comprising an SR-BI intronic 
which is at least about 70% 75%, 80%, 85%, 90%, 95%, or preferably at least about 98%, 

1 5 and most preferably at least about 99% identical to an intronic nucleotide sequence shown 
in Figure 2 or set forth in any of SEQ ID NOS. 1 - 1 2 1 or a complement thereof. In fact, as 
described herein, several alleles of human SR-BI genes have been identified. The invention 
is intended to encompass all of these alleles and SR-BI alleles not yet identified, which can 
be identified, e.g, according to the methods described herein. 

20 The invention also provides isolated nucleic acids comprising at least one 

polymorphic region of an SR-BI gene having a nucleotide sequence which differs from the 
nucleotide sequence set forth in SEQ ID NO: 1 or SEQ ID NO: 3. Preferred nucleic acids 
have a polymorphic region located in an exon of an SR-BI gene, such as exons 1, 3 or 8. 
Accordingly, preferred nucleic acids of the invention comprise an adenine at position 146 

25 of exon 1 (as set forth in SEQ ID NO: 95), an adenine at position 1 19 of exon 3 (as set forth 
in SEQ ID NO: 96, and/or a thymidine at position 41 of exon 8 (as set forth in SEQ ID NO: 
65). Preferred nucleic acids can also have a polymorphic region in an intron, e.g., intron 
5 or 10. For example, the invention provides nucleic acids having a polymorphic 
nucleotide at position 54 of intron 5 and/or at position -41 of intron 10. In a preferred 

30 embodiment, the nucleic acid has a thymidine at position 54 of intron 5 (as set forth in SEQ 
ID NO: 66) and/or a guanine at position -41 of intron 10 (as set forth in SEQ ID NO: 97). 
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The nucleic acids can be genomic DNA, cDNA, or RNA (in which case, the nucleic acid 
has a uridine at position 54 of intron 5). 

Also within the scope of the invention are isolated nucleic acids which encode an 
SR-BI protein, such as an SR-BI protein having an amino acid sequence which differs from 
5 the amino acid sequence set forth in SEQ ID NOs 2 and 4. Preferred nucleic acids encode 
an SR-BI polypeptide comprising an amino acid sequence from SEQ ID NO: 2 or 4 in 
which residue 2 is a serine and/or in which residue 135 is an isoleucine. 

Preferred nucleic acids of the invention are from vertebrate genes encoding SR-BI 
proteins. Particularly preferred vertebrate nucleic acids are mammalian nucleic acids. A 

10 particularly preferred nucleic acid of the invention is a human nucleic acid, such as a 
nucleic acid comprising an SR-BI intronic sequence shown in Figure 2 or set forth in any 
of SEQ ID NOS. 1-121 or complement thereof or an allele comprising a nucleotide 
sequence set forth in SEQ ID NO: 65 or SEQ ID NO: 97. 

Another aspect of the invention provides a nucleic acid which hybridizes under 

15 appropriate stringency to an SR-BI intronic sequence having a nucleotide sequence shown 
in introns shown in Figure 2 or in intronic sequences set forth in any of SEQ ID Nos. 1-121 
or complement thereof. Appropriate stringency conditions which promote DNA 
hybridization, for example, 6.0 x sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by a wash of 2.0 x SSC at 50 °C, are known to those skilled in the art or can be 

20 found in Current Protocols in Molecular Biology, John Wiley & Sons, N. Y. (1 989), 6.3. 1 - 
6.3.6. For example, the salt concentration in the wash step can be selected from a low 
stringency of about 2.0 x SSC at 50°C to a high stringency of about 0.2 x SSC at 50°C. In 
addition, the temperature in the wash step can be increased from low stringency conditions 
at room temperature, about 22°C, to high stringency conditions at about 65°C. Both 

25 temperature and salt may be varied, or temperature or salt concentration may be held 
constant while the other variable is changed. In a preferred embodiment, a nucleic acid of 
the present invention will bind to at least about 20, preferably at least about 25, more 
preferably at least about 30 and most preferably at least about 50 consecutive nucleotides 
of a sequence shown in Figure 2 or set forth in any of SEQ ID Nos. 1-1 21 under moderately 

30 stringent conditions, for example at about 2.0 x SSC and about 40°C. Even more preferred 
nucleic acids of the invention are capable of hybridizing under stringent conditions to an 
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intronic sequence of at least about 20, 30, 40, or at least about 50 nucleotides as shown in 
Figure 2 or as set forth in an intronic sequence of any of SEQ ID Nos.1-121. 

Hybridization, as described above, can be used to isolate nucleic acids comprising 
an SR-BI intron or portion thereof from various animal species. A comparison of these 

5 nucleic acids should be indicative of intronic sequences which may have a regulatory or 
other function, since these regions are expected to be conserved among various species. 
Hybridization can also be used to isolate SR-BI alleles. 

The nucleic acid of the invention can be single stranded DNA (e.g., an 
oligonucleotide), double stranded DNA (e.g., double stranded oligonucleotide) or RNA. 

10 Preferred nucleic acids of the invention can be used as probes or primers. Primers of the 
invention refer to nucleic acids which hybridize to a nucleic acid sequence which is adjacent 
to the region of interest or which covers the region of interest and is extended. A primer 
can be used alone in a detection method, or a primer can be used together with at least one 
other primer or probe in a detection method. Primers can also be used to amplify at least 

15 a portion of a nucleic acid. Probes of the invention refer to nucleic acids which hybridize 
to the region of interest and which are not further extended. For example, a probe is a 
nucleic acid which hybridizes to a polymorphic region of an SR-BI gene, and which by 
hybridization or absence of hybridization to the DNA of a subject will be indicative of the 
identity of the allelic variant of the polymorphic region of the SR-BI gene. 

20 Numerous procedures for determining the nucleotide sequence of a nucleic acid, or 

for determining the presence of mutations in nucleic acids include a nucleic acid 
amplification step, which can be carried out by, e.g., polymerase chain reaction (PCR). 
Accordingly, in one embodiment, the invention provides primers for amplifying portions 
of an SR-BI gene, such as portions of exons and/or portions of introns. In a preferred 

25 embodiment, the exons and/or sequences adjacent to the exons of the human SR-BI gene 
will be amplified to, e.g., detect which allelic variant of a polymorphic region is present in 
the SR-BI gene of a subject. Preferred primers comprise a nucleotide sequence 
complementary to an SR-BI intronic sequence or a specific allelic variant of an SR-BI 
polymorphic region and of sufficient length to selectively hybridize with an SR-BI gene. 

30 In a preferred embodiment, the primer, e.g., a substantially purified oligonucleotide, 
comprises a region having a nucleotide sequence which hybridizes under stringent 
conditions to about 6, 8, 10, or 12, preferably 25, 30, 40, 50, or 75 consecutive nucleotides 
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of an SR-BI gene. In an even more preferred embodiment, the primer is capable of 
hybridizing to an SR-BI intron and has a nucleotide sequence of an intronic sequence shown 
in Figure 2 or set forth in any of SEQ ID Nos. 1-121, complements thereof, allelic variants 
thereof, or complements of allelic variants thereof. For example, primers comprising a 
nucleotide sequence of at least about 15 consecutive nucleotides, at least about 20 
nucleotides or having from about 15 to about 25 nucleotides shown in Figure 2 or set forth 
in any of SEQ ID NOS. 1-121 or complement thereof are provided by the invention. 
Primers having a sequence of more than about 25 nucleotides are also within the scope of 
the invention. Preferred primers of the invention are primers that can be used in PCR for 
amplifying each of the exons of an SR-BI gene. Even more preferred primers of the 
invention have the nucleotide sequence set forth in any of SEQ ID Nos. 41 -64 and 89-94 
(see Table IV). 

Primers can be complementary to nucleotide sequences located close to each other 
or ftirther apart, depending on the use of the amplified DNA. For example, primers can be 
chosen such that they amplify DNA fragments of at least about 10 nucleotides or as much 
as several kilobases. Preferably, the primers of the invention will hybridize selectively to 
nucleotide sequences located about 150 to about 350 nucleotides apart. 

For amplifying at least a portion of a nucleic acid, a forward primer (i.e., 5* primer) 
and a reverse primer (i.e., 3' primer) will preferably be used. Forward and reverse primers 
hybridize to complementary strands of a double stranded nucleic acid, such that upon 
extension from each primer, a double stranded nucleic acid is amplified. A forward primer 
can be a primer having a nucleotide sequence or a portion of the nucleotide sequence shown 
in Figure 2 or in SEQ ID Nos. 1-40, 65, 66, and 95-97. A reverse primer can be a primer 
having a nucleotide sequence or a portion of the nucleotide sequence that is complementary 
to a nucleotide sequence shown in Figure 2 or in SEQ ID Nos. 1-40, 65, 66, and 95-97. 
Preferred forward primers comprise a nucleotide sequence set forth in SEQ ID Nos. 41, 43, 
45, 47, 49, 51, 53, 55, 57, 59, 61 , 63, and 85 (shown in Table IV). Preferred reverse primers 
comprise a nucleotide sequence 'set forth in SEQ ID Nos. 42, 44, 46, 48, 50, 52, 54, 56, 58, 
60, 62, 64, and 86. Preferred pairs of primers for amplifying each of the exons of human 
SR-BI are set forth in Table IV. 

Yet other preferred primers of the invention are nucleic acids which are capable of 
selectively hybridizing to an allelic variant of a polymorphic region of an SR-BI gene. 
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Thus, such primers can be specific for an SR-BI gene sequence, so long as they have a 
nucleotide sequence which is capable of hybridizing to an SR-BI gene. Preferred primers 
are capable of specifically hybridizing to an allelic variant in which nucleotide 146 of exon 
1 of human SR-BI is an adenine, e.g., a nucleic acid having SEQ ID NO: 95; an allelic 
5 variant in which nucleotide 1 1 9 of exon 3 is an adenine, e.g., a nucleic acid having SEQ ID 
NO: 96; or an allelic variant in which nucleotide 41 of exon 8 of human SR-BI is a 
thymidine, e.g., a nucleic acid having SEQ ID NO: 65. Other preferred primers are capable 
of specifically hybridizing to an allelic variant in which nucleotide 54 of intron 5 is a 
thymidine, e.g., a nucleic acid having SEQ ID NO: 66 or nucleotide -41 of intron 10 is a 

10 guanine, e.g., a nucleic acid having SEQ ID NO: 97. Such primers can be used, e.g., in 
sequence specific oligonucleotide priming as described further herein. 

The SR-BI nucleic acids of the invention can also be used as probes, e.g., in 
therapeutic and diagnostic assays. For instance, the present invention provides a probe 
comprising a substantially purified oligonucleotide, which oligonucleotide comprises a 

15 region having a nucleotide sequence that hybridizes under stringent conditions to at least 
approximately 6, 8, 10 or 12, preferably about 25, 30, 40, 50 or 75 consecutive nucleotides 
of an SR-BI gene. In one embodiment, the probes preferably hybridize to an intron of an 
SR-BI gene, having an intronic nucleotide sequence shown in Figure 2 or set forth in any 
of SEQ ID Nos. 1-121, allelic variants thereof, complements thereof or complements of 

20 allelic variants thereof. In another embodiment, the probes are capable of hybridizing to a 
nucleotide sequence encompassing an intron/exon border of an SR-BI gene. 

Other preferred probes of the invention are capable of hybridizing specifically to 
a region of an SR-BI gene which is polymorphic. In an even more preferred embodiment 
of the invention, the probes are capable of hybridizing specifically to one allelic variant of 

25 an SR-BI gene having a nucleotide sequence which differs from the nucleotide sequence 
set forth in SEQ ID NO: 1 or 3. Such probes can then be used to specifically detect which 
allelic variant of a polymorphic region of an SR-BI gene is present in a subject. The 
polymorphic region can be located in the promoter, exon, or intron sequences of an SR-BI 
gene. 

30 For example, preferred probes of the invention are capable of hybridizing 

specifically to a region overlapping nucleotide 146 of exon 1 of the human SR-BI gene. In 
one embodiment, the probe overlapping nucleotide 146 of exon 1 is capable of hybridizing 
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specifically to a nucleotide sequence wherein nucleotide 146 is an adenine (as shown in 
SEQ ID NO: 95). Examples of such probes include a probe having the nucleotide sequence 
5' GCGGAGCAGCTCATGTCTGCG 3' (SEQ ID NO: 98); 5' 
CTTTCGCGGAGCAGCTCATGTCTGCGCGCCT 3' (SEQ ID NO: 99); and probes having 
5 the complement of these nucleotide sequences, i.e., 5' CGCAGACATGAGCTGCTCCGC 
3' (SEQ ID NO: 100); 5* AGGCGCGCAGACATGAGCTGCTCCGCCAAAG 3 f (SEQ ID 
NO: 101). The bold nucleotides represents the location of the nucleotide polymorphism. 
In another embodiment, the probe overlapping nucleotide 146 of exon 1 is capable of 
specifically hybridizing to a nucleotide sequence wherein nucleotide 146 is a guanine (as 

10 shown in Figure 2 and set forth in SEQ ID NO: 5). Examples of such probes include a 
probe having the nucleotide sequence 5' GCGGAGCAGCGCATGTCTGCG 3' (SEQ ID 
NO: 102); CTTTCGCGGAGCAGCGCATGTCTGCGCGCCT 3' (SEQ ID NO: 103) and 
probes having the complement of these nucleotide sequences, i.e., 5' 
CGCAGACATGCGCTGCTCCGC 3' (SEQ ID NO: 104); 5' 

15 AGGCGCGCAGACATGCGCTGCTCCGCCAAAG 3' (SEQ ID NO: 105). 

Preferred probes of the invention are capable of hybridizing specifically to a region 
overlapping nucleotide 1 19 of exon 3 of the human SR-BI gene. In one embodiment, the 
probe overlapping nucleotide 119 of exon 3 is capable of hybridizing specifically to a 
nucleotide sequence wherein nucleotide 1 19 is an adenine (as shown in SEQ ID NO: 96). 

20 Examples of such probes include a probe having the nucleotide sequence 5' 
TTGGGCATGATGATGTAGACG 3' (SEQ ID NO: 106); 5' 
GGATGTTGGGCATGATGATGTAGACGCTCTC 3' (SEQ ID NO: 107); and probes 
having the complement of these nucleotide sequences, i.e., 5' 
CGACTACATCATCATGCCCAA 3' (SEQ ID NO: 108); 5' 

25 GAGAGCGACTACATCATCATGCCCAACATCC 3' (SEQ ID NO: 109). The bold 
nucleotides represents the location of the nucleotide polymorphism. In another 
embodiment, the probe overlapping nucleotide 119 of exon 3 is capable of specifically 
hybridizing to a nucleotide sequence wherein nucleotide 1 1 9 is a guanine (as shown in 
Figure 2 and set forth in SEQ ID NO: 7). Examples of such probes include a probe having 

30 the nucleotide sequence 5' TTGGGCATGAGGATGTAGACG 3' (SEQ ID NO: 110); 
GGATGTTGGGCATGAGGATGTAGACGCTCTC 3' (SEQ ID NO: 111) and probes 
having the complement of these nucleotide sequences, i.e., 5' 
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CGACTACATCCTCATGCCCAA 3' (SEQ ID NO: 112); 5' 
GAGAGCGACTACATCCATCATGCCCAACATCC 3' (SEQ ID NO: 113). 

Other preferred probes of the invention are capable of hybridizing specifically to 
a region overlapping nucleotide 41 of exon 8 of the human SR-B1 gene. In one 

5 embodiment, the probe overlapping nucleotide 41 of exon 8 is capable of hybridizing 
specifically to a nucleotide sequence wherein nucleotide 41 is a thymidine (as shown in 
SEQ ID NO: 65). Examples of such probes include a probe having the nucleotide sequence 
5' AACCGGGTCAGCGTTGAGGA 3' (SEQ ID NO: 67); 5' 
TGCCAGAACCGGGTCAGCGTTGAGGAAGTGA 3' (SEQ ID NO: 68); and probes 

10 having the complement of these nucleotide sequences, i.e., 
5TCCTCAACGCTGACCCGGTT 3* (SEQ ID NO: 69); 5' 
TCACTTCCTCAACGCTGACCCGGTTCTGGCA 3' (SEQ ID NO: 70). The bold 
nucleotides represents the location of the nucleotide polymorphism. In another 
embodiment, the probe overlapping nucleotide 41 of exon 8 is capable of specifically 

15 hybridizing to a nucleotide sequence wherein nucleotide 41 is a cytidine (as shown in 
Figure 2 and set forth in SEQ ID NO: 12). Examples of such probes include a probe having 
the nucleotide sequence 5' AACCGGGTCGGCGTTGATGA 3' (SEQ ID NO: 71); 
TGCCAGAACCGGGTCGGCGT 

TGATGAAGTGA 3' (SEQ ID NO: 72) and probes having the complement of these 
20 nucleotide sequences, i.e., 5' TCATCAACGCCGACCCGGTT 3* (SEQ ID NO: 73); 5' 

TCACTTCATCAACGCCGACCCGGTTCTGGCA 3' (SEQ ID NO: 74). 

Yet other preferred probes of the invention are capable of hybridizing specifically 

to a region overlapping nucleotide 54 of intron 5 of the human SR-BI gene. In one 

embodiment, the probe overlapping nucleotide 54 of intron 5 is capable of hybridizing 
25 specifically to a nucleotide sequence wherein nucleotide 54 is a cytidine (as shown in Figure 

2 and set forth in SEQ ID NOS. 9 and 26). Examples of such probes include a probe having 

the nucleotide sequence 5' AGCCATGGCCGGGCCCACCCT 3' (SEQ ID NO: 75); 5' 

CGAGCAGCCATG 

GCCGGGCCCACCCTCCCCT 3' (SEQ ID NO: 76) ; and probes having the complement 
30 of these nucleotide sequences, i.e., 5' AGGGTGGGCCCGGCCATGGCT 3 1 (SEQ ID NO: 
77); 5' AGGGGAGGGTGGGCCCGGCCATGGCTGCTCG 3' (SEQ ID NO: 78). In 
another embodiment, the probe overlapping nucleotide 54 of intron 5 is capable of 
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specifically hybridizing to a nucleotide sequence wherein nucleotide 54 is a thymidine (as 
shown in SEQ ID NO: 66). Examples of such probes include a probe having the nucleotide 
sequence 5' AGCCATGGCCAGGCCCACCCT 3' (SEQ ID NO: 79); 5' 
CGAGCAGCCATGGCCAG 

5 GCCC ACCCTCCCCT 3' (SEQ ID NO: 80) ; and probes having the complement of these 
nucleotide sequences, i.e., 5' AGGGTGGGCCTGGCCATGGCT 3' (SEQ ID NO: 81); 5' 
AGGGGAGGGTGGGCCTGGCCATGGCTGCTCG 3' (SEQ ID NO: 82). 

Still other preferred probes of the invention are capable of hybridizing specifically 
to a region overlapping nucleotide -41 of intron 10 of the human SR-BI gene. In one 

10 embodiment, the probe overlapping nucleotide -41 of intron 10 is capable of hybridizing 
specifically to a nucleotide sequence wherein nucleotide -41 is a guanine (as shown in SEQ 
ID NO: 97). Examples of such probes include a probe having the nucleotide sequence 5' 
TGGGGCCGCACGCTGCGGGCT 3' (SEQ ID NO: 114); 5' 
TGAGCTGGGGCCGCACGCTGCGGGCTACAGC 3' (SEQ ID NO: 115); and probes . 

15 having the complement of these nucleotide sequences, i.e., 5* 
AGCCCGCAGCGTGCGGCCCCA 3' (SEQ ID NO: 116); 5' 
GCTGTAGCCCGCAGCGTGCGGCCCCAGCTCA 3' (SEQ ID NO: 117). The bold 
nucleotides represents the location of the nucleotide polymorphism. In another 
embodiment, the probe overlapping nucleotide -41 of intron 10 is capable of specifically 

20 hybridizing to a nucleotide sequence wherein nucleotide -41 is a cytidine (as shown in 
Figure 2 and set forth in SEQ ID NO: 15). Examples of such probes include a probe having 
the nucleotide sequence 5' TGGGGCCGCAGGCTGCGGGCT 3' (SEQ ID NO: 118); 
TGAGCTGGGGCCGCAGGCTGCGGGCTACAGC 3' (SEQ ID NO: 1 19) and probes 
having the complement of these nucleotide sequences, i.e., 5' 

25 AGCCCGCAGCCTGCGGCCCCA 3' (SEQ ID NO: 120); 5' 
GCTGTAGCCCGCAGCCTGCGGCCCCAGCTCA 3' (SEQ ID NO: 121). 

Particularly, preferred probes of the invention have a number of nucleotides 
sufficient to allow specific hybridization to the target nucleotide sequence. Where the target 
nucleotide sequence is present in a large fragment of DNA, such as a genomic DNA 

30 fragment of several tens or hundreds of kilobases, the size of the probe may have to be 
longer to provide sufficiently specific hybridization, as compared to a probe which is used 
to detect a target sequence which is present in a shorter fragment of DNA. For example, 
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in some diagnostic methods, a portion of an SR-BI gene may first be amplified and thus 
isolated from the rest of the chromosomal DNA and then hybridized to a probe. In such a 
situation, a shorter probe will likely provide sufficient specificity of hybridization. For 
example, a probe having a nucleotide sequence of about 10 nucleotides may be sufficient. 

In preferred embodiments, the probe or primer further comprises a label attached 
thereto, which, e.g., is capable of being detected, e.g. the label group is selected from 
amongst radioisotopes, fluorescent compounds, enzymes, and enzyme co-factors. 

In a preferred embodiment of the invention, the isolated nucleic acid, which is used, 
e.g., as a probe or a primer, is modified, such as to become more stable. Exemplary nucleic 
acid molecules which are modified include phosphoramidate, phosphothioate and 
methylphosphonate analogs of DNA (see also U.S. Patents 5,176,996; 5,264,564; and 
5,256,775). 

The nucleic acids of the invention can also be modified at the base moiety, sugar 
moiety, or phosphate backbone, for example, to improve stability of the molecule. The 
nucleic acids, e.g., probes or primers, may include other appended groups such as peptides 
(e.g., for targeting host cell receptors in vivo), or agents facilitating transport across the cell 
membrane (see, e.g., Letsinger et al., 1989, Proc. Natl. Acad. Sci. U.S.A. 86:6553-6556; 
Lemaitre et al., 1987, Proc. Natl. Acad. Sci. 84:648-652; PCT Publication No. 
WO88/09810, published December 15, 1988), hybridization-triggered cleavage agents. 
(See, e.g., Krol et al., 1988, BioTechniques 6:958-976) or intercalating agents. (See, e.g., 
Zon, 1988, Pharm. Res. 5:539-549). To this end, the nucleic acid of the invention may be 
conjugated to another molecule, e.g., a peptide, hybridization triggered cross-linking agent, 
transport agent, hybridization-triggered cleavage agent, etc. 

The isolated nucleic acid comprising an SR-BI intronic sequence may comprise at 
least one modified base moiety which is selected from the group including but not limited 
to 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4- 
acetylcytidine, 5 -(carboxy hydroxy methyl) uracil, 5-carboxymethylaminomethyl-2- 
thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 
inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytidine, 5-methylcytidine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 
mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
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isopentenyladenine, uraciI-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2- 
thiocytidine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uraciI-5- 
oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3- 
N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 
5 The isolated nucleic acid may also comprise at least one modified sugar moiety 

selected from the group including but not limited to arabinose, 2-fluoroarabinose, xylulose, 
and hexose. 

In yet another embodiment, the nucleic acid comprises at least one modified 
phosphate backbone selected from the group consisting of a phosphorothioate, a 

10 phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a phosphordiamidate, a 
methylphosphonate, an alkyl phosphotriesten and a formacetal or analog thereof. 

In yet a further embodiment, the nucleic acid is an a-anomeric oligonucleotide. An 
a-anomeric oligonucleotide forms specific double-stranded hybrids with complementary 
RNA in which, contrary to the usual p-units, the strands run parallel to each other (Gautier 

15 et aL, 1987, Nucl. Acids Res. 15:6625-6641). The oligonucleotide is a 2'-0- 
methylribonucleotide (Inoue et aL 1987, Nucl. Acids Res. 15:6131-6148), or a chimeric 
RNA-DNA analogue (Inoue et al., 1987, FEBS Lett. 215:327-330). 

Any nucleic acid fragment of the invention can be prepared according to methods 
well known in the art and described, e.g., in Sambrook, J. Fritsch, E.F., and Maniatis, T. 

20 (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, N.Y. For example, discrete fragments of the DNA can be prepared and 
cloned using restriction enzymes. Alternatively, discrete fragments can be prepared using 
the Polymerase Chain Reaction (PCR) using primers having an appropriate sequence. 

Oligonucleotides of the invention may be synthesized by standard methods known 

25 in the art, e.g. by use of an automated DNA synthesizer (such as are commercially available 
from Biosearch, Applied Biosystems, etc.). As examples, phosphorothioate 
oligonucleotides may be synthesized by the method of Stein et al. (1988, Nucl. Acids Res. 
16:3209), methylphosphonate oligonucleotides can be prepared by use of controlled pore 
glass polymer supports (Sarin et al., 1988, Proc. Natl. Acad. Sci. U.S.A. 85:7448-7451), etc. 

30 The invention also provides vectors and plasmids containing the nucleic acids of the 

invention. For example, in one embodiment, the invention provides a vector comprising 
at least a portion of an SR-BI gene comprising a polymorphic region and/or intronic 
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sequence. Thus, the invention provides vectors for expressing at least a portion of the 
newly identified allelic variants of the human SR-BI gene, as well as other allelic variants, 
having a nucleotide sequence which is different from the nucleotide sequence disclosed in 
Calvo and Vega, supra. The allelic variants can be expressed in eukaryotic cells, e.g., cells 
of a subject, or in prokaryotic cells. 

In one embodiment, the vector comprising at least a portion of an SR-BI allele is 
introduced into a host cell, such that a protein encoded by the allele is synthesized. The SR- 
BI protein produced can be used, e.g., for the production of antibodies, which can be used, 
e.g., in methods for detecting mutant forms of SR-BI. Alternatively, the vector can be used 
for gene therapy, and be, e.g., introduced into a subject to produce SR-BI protein. Host 
cells comprising a vector having at least a portion of an SR-BI gene are also within the 
scope of the invention. 

Polypeptides of the present invention 

The present invention makes available isolated SR-BI polypeptides, such as SR-BI 
polypeptides which are encoded by specific allelic variants of SR-BI, such as those 
identified herein. Accordingly, preferred SR-BI polypeptides of the invention have an 
amino acid sequence which differs from SEQ ID NOs 2 and 4. In one embodiment, the SR- 
BI polypeptides are isolated from, or otherwise substantially free of other cellular proteins. 
The term "substantially free of other cellular proteins" (also referred to herein as 
"contaminating proteins") or "substantially pure or purified preparations" are defined as 
encompassing preparations of SR-BI polypeptides having less than about 20% (by dry 
weight) contaminating protein, and preferably having less than about 5% contaminating 
protein. Functional forms of the subject polypeptides can be prepared, for the first time, as 
purified preparations by using a cloned gene as described herein. 

Preferred SR-BI proteins of the invention have an amino acid sequence which is at 
least about 60%, 70%, 80%, 85%, 90%, or 95% identical or homologous to an amino acid 
sequence of SEQ ID NO. 2 or 4. Even more preferred SR-BI proteins comprise an amino 
acid sequence which is at least about 97, 98, or 99% homologous or identical to an amino 
acid sequence of SEQ ID NO. 2 or 4. Such proteins can be recombinant proteins, and can 
be, e.g., produced in vitro from nucleic acids comprising a specific allele of an SR-BI 
polymorphic region. For example, recombinant polypeptides preferred by the present 
invention can be encoded by a nucleic acid, which is at least 85% homologous and more 
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preferably 90% homologous and most preferably 95 % homologous with a nucleotide 
sequence set forth in SEQ ID NOS. 1 or 3, and comprises an allele of a polymorphic region 
that differs from that set forth in SEQ ID Nos. 1 and 3. Polypeptides which are encoded by 
a nucleic acid that is at least about 98-99% homologous with the sequence of SEQ ID NOS: 

5 * 1 or 3 and comprises an allele of a polymorphic region that differs from that set forth in 
SEQ ID Nos. 1 and 3 are also within the scope of the invention. 

In a preferred embodiment, an SR-BI protein of the present invention is a 
mammalian SR-BI protein. In an even more preferred embodiment, the SR-BI protein is 
a human protein, such as an SR-BI polypeptide comprising an amino acid sequence from 

10 SEQ ID NO.2 in which amino acid 2 is a serine and/or amino acid 135 is an isoleucine. 
Other preferred SR-BI polypeptides comprise an amino acid sequence from SEQ ID NO. 
4 in which amino acid 2 is a serine. 

SR-BI polypeptides preferably are capable of functioning in one of either role of an 
agonist or antagonist of at least one biological activity of a wild-type ("authentic") SR-BI 

1 5 protein of the appended sequence listing. The term "evolutionary related to", with respect 
to amino acid sequences of SR-BI proteins, refers to both polypeptides having amino acid 
sequences which have arisen naturally, and also to mutational variants of human SR-BI 
polypeptides which are derived, for example, by combinatorial mutagenesis. 

Full length proteins or fragments corresponding to one or more particular motifs 

20 and/or domains or to arbitrary sizes, for example, at least 5, 10, 25, 50, 75 and 100, amino 
acids in length are within the scope of the present invention. 

Isolated peptidyl portions of SR-BI proteins can be obtained by screening peptides 
recombinantly produced from the corresponding fragment of the nucleic acid encoding such 
peptides. In addition, fragments can be chemically synthesized using techniques known in 

25 the art such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example, 
an SR-BI polypeptide of the present invention may be arbitrarily divided into fragments of 
desired length with no overlap of the fragments, or preferably divided into overlapping 
fragments of a desired length. The fragments can be produced (recombinantly or by 
chemical synthesis) and tested to identify those peptidyl fragments which can function as 

30 either agonists or antagonists of a wild-type (e.g., "authentic") SR-BI protein. 

In general, polypeptides referred to herein as having an activity (e.g., are "bioactive") 
of an SR-BI protein are defined as polypeptides which which mimic or antagonize all or a 
portion of the biological/biochemical activities of an SR-BI protein having SEQ ID NOs 
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2 or 4, such as the ability to bind lipids. Other biological activities of the subject SR-BI 
proteins are described herein or will be reasonably apparent to those skilled in the art. 
According to the present invention, a polypeptide has biological activity if it is a specific 
agonist or antagonist of a naturally-occurring form of an SR-BI protein. 

5 Assays for determining whether an SR-BI protein or variant thereof, has one or more 

biological activities are well known in the art. 

Other preferred proteins of the invention are those encoded by the nucleic acids set 
forth in the section pertaining to nucleic acids of the invention. In particular, the invention 
provides fusion proteins, e.g., SR-BI-immunoglobulin fusion proteins. Such fusion proteins 

1 0 can provide, e.g., enhanced stability and solubility of SR-BI proteins and may thus be useful 
in therapy. Fusion proteins can also be used to produce an immunogenic fragment of an 
SR-BI protein. For example, the VP6 capsid protein of rotavirus can be used as an 
immunologic carrier protein for portions of the SR-BI polypeptide, either in the monomelic 
form or in the form of a viral particle. The nucleic acid sequences corresponding to the 

1 5 portion of a subject SR-BI protein to which antibodies are to be raised can be incorporated 
into a fusion gene construct which includes coding sequences for a late vaccinia virus 
structural protein to produce a set of recombinant viruses expressing fusion proteins 
comprising SR-BI epitopes as part of the virion. It has been demonstrated with the use of 
immunogenic fusion proteins utilizing the Hepatitis B surface antigen fusion proteins that 

20 recombinant Hepatitis B virions can be utilized in this role as well. Similarly, chimeric 
constructs coding for fusion proteins containing a portion of an SR-BI protein and the 
poliovirus capsid protein can be created to enhance immunogenicity of the set of 
polypeptide antigens (see, for example, EP Publication No: 0259149; and Evans et al. 
(1989) Nature 339:385; Huang et al. (1988) J. Virol. 62:3855; and Schlienger et al. (1992) 

25 J. Virol. 66:2). 

The Multiple antigen peptide system for peptide-based immunization can also be 
utilized to generate an immunogen, wherein a desired portion of an SR-BI polypeptide is 
obtained directly from organo-chemical synthesis of the peptide onto an oligomeric 
branching lysine core (see, for example, Posnett et al. (1988) JBC 263:1719 and Nardelli 

30 et al. (1992) J. Immunol. 148:914). Antigenic determinants of SR-BI proteins can also be 
expressed and presented by bacterial cells. 

In addition to utilizing fusion proteins to enhance immunogenicity, it is widely 
appreciated that fusion proteins can also facilitate the expression of proteins, and 
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accordingly, can be used in the expression of the SR-BI polypeptides of the present 
invention. For example, SR-BI polypeptides can be generated as glutathione-S-transferase 
(GST-fusion) proteins. Such GST-flision proteins can enable easy purification of the SR-BI 
polypeptide, as for example by the use of glutathione-derivatized matrices (see, for 
5 example, Current Protocols in Molecular Biology, eds. Ausubel et al. (N.Y.: John Wiley & 
Sons, 1991)). 

The present invention further pertains to methods of producing the subject SR-BI 
polypeptides. For example, a host cell transfected with a nucleic acid vector directing 
expression of a nucleotide sequence encoding the subject polypeptides can be cultured 

10 under appropriate conditions to allow expression of the peptide to occur. Suitable media 
for cell culture are well known in the art. The recombinant SR-BI polypeptide can be 
isolated from cell culture medium, host cells, or both using techniques known in the art for 
purifying proteins including ion-exchange chromatography, gel filtration chromatography, 
ultrafiltration, electrophoresis, and immunoaffinity purification with antibodies specific for 

1 5 such peptide. In a preferred embodiment, the recombinant SR-BI polypeptide is a fusion 
protein containing a domain which facilitates its purification, such as GST fusion protein. 

Moreover, it will be generally appreciated that, under certain circumstances, it may 
be advantageous to provide homologs of one of the subject SR-BI polypeptides which 
function in a limited capacity as one of either an SR-BI agonist (mimetic) or an SR-BI 

20 antagonist, in order to promote or inhibit only a subset of the biological activities of the 
naturally-occurring form of the protein. Thus, specific biological effects can be elicited by 
treatment with a homolog of limited function, and with fewer side effects relative to 
treatment with agonists or antagonists which are directed to all of the biological activities 
of naturally occurring forms of SR-BI proteins. 

25 Homologs of each of the subject SR-BI proteins can be generated by mutagenesis, 

such as by discrete point mutation(s), or by truncation. For instance, mutation can give rise 
to homologs which retain substantially the same, or merely a subset, of the biological 
activity of the SR-BI polypeptide from which it was derived. Alternatively, antagonistic 
forms of the protein can be generated which are able to inhibit the function of the naturally 

30 occurring form of the protein, such as by competitively binding to an SR-BI receptor. 

The recombinant SR-BI polypeptides of the present invention also include homologs 
of SR-BI polypeptides which differ from the SR-BI proteins having SEQ ID NO.2 or 4, 
such as versions of those protein which are resistant to proteolytic cleavage, as for example, 
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due to mutations which alter ubiquitination or other enzymatic targeting associated with the 
protein. 

SR-BI polypeptides may also be chemically modified to create SR-BI derivatives 
by forming covalent or aggregate conjugates with other chemical moieties, such as glycosyl 
5 groups, lipids, phosphate, acetyl groups and the like. Covalent derivatives of SR-BI 
proteins can be prepared by linking the chemical moieties to functional groups on amino 
acid sidechains of the protein or at the N-terminus or at the C-terminus of the polypeptide. 

Modification of the structure of the subject SR-BI polypeptides can be for such 

1 0 purposes as enhancing therapeutic or prophylactic efficacy, stability (e.g., ex vivo shelf life 
and resistance to proteolytic degradation), or post-translational modifications (e.g., to alter 
phosphorylation pattern of protein). Such modified peptides, when designed to retain at 
least one activity of the naturally-occurring form of the protein, or to produce specific 
antagonists thereof, are considered functional equivalents of the SR-BI polypeptides 

15 described in more detail herein. Such modified peptides can be produced, for instance, by 
amino acid substitution, deletion, or addition. The substitutional variant may be a 
substituted conserved amino acid or a substituted non-conserved amino acid. 

For example, it is reasonable to expect that an isolated replacement of a leucine with 
an isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a similar 

20 replacement of an amino acid with a structurally related amino acid (i.e. isosteric and/or 
isoelectric mutations) will not have a major effect on the biological activity of the resulting 
molecule. Conservative replacements are those that take place within a family of amino 
acids that are related in their side chains. Genetically encoded amino acids can be divided 
into four families: (1) acidic = aspartate, glutamate; (2) basic = lysine, arginine, histidine; 

25 (3) nonpolar = alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, 
tryptophan; and (4) uncharged polar = glycine, asparagine, glutamine, cysteine, serine, 
threonine, tyrosine. In similar fashion, the amino acid repertoire can be grouped as (1) 
acidic = aspartate, glutamate; (2) basic = lysine, arginine histidine, (3) aliphatic = glycine, 
alanine, valine, leucine, isoleucine, serine, threonine, with serine and threonine optionally 

30 be grouped separately as aliphatic-hydroxyl; (4) aromatic = phenylalanine, tyrosine, 
tryptophan; (5) amide = asparagine, glutamine; and (6) sulfur -containing = cysteine and 
methionine, (see, for example, Biochemistry, 2 nd ed., Ed. by L. Stryer, WH Freeman and 
Co.: 1981). Whether a change in the amino acid sequence of a peptide results in a 
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functional SR-BI homolog (e.g., functional in the sense that the resulting polypeptide 
mimics or antagonizes the wild-type form) can be readily determined by assessing the 
ability of the variant peptide to produce a response in cells in a fashion similar to the wild- 
type protein, or competitively inhibit such a response. Polypeptides in which more than one 
5 replacement has taken place can readily be tested in the same manner. 

Kits 

As set forth herein, the invention provides methods, e.g., diagnostic and therapeutic 
methods, e.g., for determining the type of allelic variant of a polymorphic region present 

10 in an SR-BI gene, such as a human SR-BI gene. In preferred embodiments, the methods 
use probes or primers comprising nucleotide sequences which are complementary to an SR- 
BI intronic sequence or to a polymorphic region of an SR-BI gene. Accordingly, the 
invention provides kits for performing these methods. 

In a preferred embodiment, the invention provides a kit for determining whether a 

1 5 subject has or is at risk of developing a disease or condition associated with a specific allelic 
variant of an SR-BI polymorphic region. In an even more preferred embodiment, the 
disease or disorder is characterized by an abnormal SR-BI activity. In an even more 
preferred embodiment, the invention provides a kit for determining whether a subject has 
or is at risk of developing a cardiovascular disease, e.g., ischemia, restenosis, congestive 

20 heart failure, atherosclerosis, aberrant lipid (e.g., cholesterol), lipoprotein (e.g., HDL, LDL) 
or triglyceride levels, gallstone formation, or an abnormal body mass index, e.g, obesity or 
cachexia. A preferred kit provides reagents for determining whether a female subject is 
likely to develop high LDL levels or a high BMI or whether a male subject is likely to 
develop low HDL levels. 

25 Preferred kits comprise at least one probe or primer which is capable of specifically 

hybridizing to an SR-BI sequence or polymorphic region and instructions for use. The kits 
preferably comprise at least one of the above described nucleic acids, e.g., including nucleic 
acids hybridizing to an exon/intron border. Preferred kits for amplifying at least a portion 
of an SR-BI gene, e.g., an exon, comprise two primers, at least one of which is capable of 

30 hybridizing to an SR-BI intronic sequence or an allelic variant sequence. Even more 
preferred kits comprise a pair of primers selected from the group consisting of SEQ ID NO: 
41 and SEQ ID NO: 42, SEQ ID NO: 43 and SEQ ID NO: 44, SEQ ID NO: 45 and SEQ 
ID NO: 46, SEQ ID NO: 47 and SEQ ID NO: 48, SEQ ID NO: 49 and SEQ ID NO: 50, 
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SEQ ID NO: 5 1 and SEQ ID NO: 52, SEQ ID NO: 53 and SEQ ID NO: 54, SEQ ID NO: 
55 and SEQ ID NO: 56, SEQ ID NO: 57 and SEQ ID NO: 58, SEQ ID NO: 59 and SEQ 
ID NO: 60, SEQ ID NO: 61 and SEQ ID NO: 62, SEQ ID NO:63 and SEQ ID NO: 64, 
SEQ ID NO: 85 and SEQ ID NO: 86, SEQ ID NO:89 and SEQ ID NO: 90, SEQ ID NO:91 
and SEQ ID NO: 92, and SEQ ID NO:93 and SEQ ID NO: 94. 

The kits of the invention can also comprise one or more control nucleic acids or 
reference nucleic acids, such as nucleic acids comprising an SR-BI intronic sequence. For 
example, a kit can comprise primers for amplifying a polymorphic region of an SR-BI gene 
and a control DNA corresponding to such an amplified DNA and having the nucleotide 
sequence of a specific allelic variant. Thus, direct comparison can be performed between 
the DNA amplified from a subject and the DNA having the nucleotide sequence of a 
specific allelic variant. In one embodiment, the control nucleic acid comprises at least a 
portion of an SR-BI gene of an individual, who does not have a cardiovascular disease, 
aberrant lipid levels, gallstones, or a disease or disorder associated with an aberrant SR-BI 
activity. 

Yet other kits of the invention comprise at least one reagent necessary to perform 
the assay. For example, the kit can comprise an enzyme. Alternatively the kit can comprise 
a buffer or any other necessary reagent. 

Predictive Medicine 

The invention further features predictive medicines, which are based, at least in 
part, on determination of the identity of SR-BI polymorphic regions which are associated 
with specific diseases or disorders. 

For example, information obtained using the diagnostic assays described herein 
(alone or in conjunction with information on another genetic defect, which contributes to 
the same disease) is useful for diagnosing or confirming that a symptomatic subject has an 
allele of a polymorphic region which is associated with a particular disease or disorder. 
Alternatively, the information (alone or in conjunction with information on another genetic 
defect, which contributes to the same disease) can be used prognostically for predicting 
whether a non-symptomatic subject is likely to develop a disease or condition, which is 
associated with one or more specific alleles of SR-BI polymorphic regions in a subject. 
Based on the prognostic information, a doctor can recommend a regimen (e.g. diet or 
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exercise) or therapeutic protocol, useful for preventing or prolonging onset of the particular 
disease or condition in the individual. 

In addition, knowledge of the identity of a particular SR-BI allele in an individual 
(the SR-BI genetic profile), alone or in conjunction with information on other genetic 

5 defects contributing to the same disease (the genetic profile of the particular disease) allows 
customization of therapy for a particular disease to the individual's genetic profile, the goal 
of "pharmacogenomics". For example, an individual's SR-BI genetic profile or the genetic 
profile of a disease or condition associated with a specific allele of an SR-BI polymorphic 
region, can enable a doctor: 1) to more effectively prescribe a drug that will address the 

10 molecular basis of the disease or condition; and 2) to better determine the appropriate 
dosage of a particular drug. For example, the expression level of SR-BI proteins, alone or 
in conjunction with the expression level of other genes, known to contribute to the same 
disease, can be measured in many patients at various stages of the disease to generate a 
transcriptional or expression profile of the disease. Expression patterns of individual 

15 patients can then be compared to the expression profile of the disease to determine the 
appropriate drug and dose to administer to the patient. 

The ability to target populations expected to show the highest clinical benefit, based 
on the SR-BI or disease genetic profile, can enable: 1) the repositioning of marketed drugs 
with disappointing market results; 2) the rescue of drug candidates whose clinical 

20 development has been discontinued as a result of safety or efficacy limitations, which are 
patient subgroup-specific; and 3) an accelerated and less costly development for drug 
candidates and more optimal drug labeling (e.g. since the use of SR-BI as a marker is useful 
for optimizing effective dose). 

These and other methods are described in further detail in the following sections. 

25 

Prognostic and Diagnostic Assays 

The present methods provide means for determining if a subject has (diagnostic) 
or is at risk of developing (prognostic) a disease, condition or disorder that is associated a 
specific SR-BI allele, e.g., a body mass disorder or an abnormal lipid level (HDL and LDL) 
30 and disorders resulting therefrom. 

The present invention provides methods for determining the molecular structure of 
an SR-BI gene, such as a human SR-BI gene, or a portion thereof. In one embodiment, 
determining the molecular structure of at least a portion of an SR-BI gene comprises 
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determining the identity of the allelic variant of at least one polymorphic region of an SR-BI 
gene. A polymorphic region of an SR-BI gene can be located in an exon, an intron, at an 
intron/exon border, or in the promoter of the SR-BI gene. 

The invention provides methods for determining whether a subject has, or is at risk 

5 of developing, a disease or condition associated with a specific allelic variant of a 
polymorphic region of an SR-BI gene. Such diseases can be associated with an aberrant 
SR-BI activity, e.g., abnormal binding to a lipid, or an aberrant SR-BI protein level. An 
aberrant SR-BI protein level can result from an aberrant transcription or post transcriptional 
regulation. Thus, allelic differences in specific regions of an SR-BI gene can result in 

10 differences of SR-BI protein due to differences in regulation of expression. In particular, 
some of the identified polymorphisms in the human SR-BI gene may be associated with 
differences in the level of transcription, RNA maturation, splicing, or translation of the SR- 
BI gene or transcription product. {. 

Analysis of one or more SR-BI polymorphic region in a subject can be useful for 

1 5 predicting whether a subject has or is likely to develop a body mass disorder, an abnormal 
lipoprotein or lipid level and disorders resulting therefrom, such as cardiovascular disorders, 
diabetes and gallstone formation. 

In addition, since SR-BI is a receptor that is capable of binding to various lipid 
related molecules, it is likely that specific alleles of the SR-BI gene are associated with 

20 other diseases or conditions involving an inappropriate lipid transfer or metabolism, e.g., 
atherosclerosis or a biliary disorder, such as gallstone formation. Accordingly, the 
invention provides diagnostic and prognostic assays for determining whether a subject is 
at risk of developing a disease characterized by an abnormal lipid level, e.g, atherosclerosis 
or gall stone formation. 

25 In preferred embodiments, the methods of the invention can be characterized as 

comprising detecting, in a sample of cells from the subject, the presence or absence of a 
specific allelic variant of one or more polymorphic regions of an SR-BI gene. The allelic 
differences can be: (i) a difference in the identity of at least one nucleotide or (ii) a 
difference in the number of nucleotides, which difference can be a single nucleotide or 

30 several nucleotides. The invention also provides methods for detecting differences in SR-BI 
genes such as chromosomal rearrangements, e.g., chromosomal dislocation. The invention 
can also be used in prenatal diagnostics. 
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A preferred detection method is allele specific hybridization using probes 
overlapping the polymorphic site and having about 5, 10, 20, 25, or 30 nucleotides around 
the polymorphic region. Examples of probes for detecting specific allelic variants of the 
polymorphic region located in exon 1 are probes comprising a nucleotide sequence set forth 
5 in any of SEQ ID NO: 98-105; probes for detecting specific allelic variants of the 
polymorphic region located in exon 3 are probes comprising a nucleotide sequence set forth 
in any of SEQ ID NO: 106-1 13; and probes for detecting specific allelic variants of the 
polymorphic region located in exon 8 are probes comprising a nucleotide sequence set forth 
in any of SEQ ID NO: 67-74. Examples of probes for detecting specific allelic variants of 

10 the polymorphic region located in intron 5 are probes comprising a nucleotide sequence set 
forth in any of SEQ ID NO: 75-82; and probes for detecting specific allelic variants of the 
polymorphic region located in intron 10 are probes comprising a nucleotide sequence set 
forth in any of SEQ ID NO: 1 14-121 . In a preferred embodiment of the invention, several 
probes capable of hybridizing specifically to allelic variants are attached to a solid phase 

15 support, e.g., a "chip". Oligonucleotides can be bound to a solid support by a variety of 
processes, including lithography. For example a chip can hold up to 250,000 
oligonucleotides (GeneChip, Affymetrix). Mutation detection analysis using these chips 
comprising oligonucleotides, also termed "DNA probe arrays" is described e.g., in Cronin 
et al. (1996) Human Mutation 7:244. In one embodiment, a chip comprises all the allelic 

20 variants of at least one polymorphic region of a gene. The solid phase support is then 
contacted with a test nucleic acid and hybridization to the specific probes is detected. 
Accordingly, the identity of numerous allelic variants of one or more genes can be identified 
in a simple hybridization experiment. For example, the identity of the allelic variant of the 
nucleotide polymorphism in exons 1, 3, 8 or in introns 5 and 10 can be determined in a 

25 single hybridization experiment. 

In other detection methods, it is necessary to first amplify at least a portion of an 
SR-BI gene prior to identifying the allelic variant. Amplification can be performed, e.g., 
by PCR and/or LCR, according to methods known in the art. In one embodiment, genomic 
DNA of a cell is exposed to two PCR primers and amplification for a number of cycles 

30 sufficient to produce the required amount of amplified DNA. In preferred embodiments, 
the primers are located between 1 50 and 350 base pairs apart. Preferred primers, such as 
primers for amplifying each of the exons of the human SR-BI gene, are listed in Table VI 
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in the Examples. Details regarding the PCR reaction are indicated in Table V, also in the 
Examples. 

Alternative amplification methods include: self sustained sequence replication 
(Guatelli, J.C. et al. 5 1990, Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional 
amplification system (Kwoh, D.Y. et al., 1989, Proc. Natl. Acad. Sci. USA 86:1 173-1 177), 
Q-Beta Replicase (Lizardi, P.M. et al., 1988, Bio/Technology 6: 1 197), or any other nucleic 
acid amplification method, followed by the detection of the amplified molecules using 
techniques well known to those of skill in the art. These detection schemes are especially 
useful for the detection of nucleic acid molecules if such molecules are present in very low 
numbers. 

In one embodiment, any of a variety of sequencing reactions known in the art can 
be used to directly sequence at least a portion of an SR-BI gene and detect allelic variants, 
e.g., mutations, by comparing the sequence of the sample sequence with the corresponding 
wild-type (control) sequence. Exemplary sequencing reactions include those based on 
techniques developed by Maxam and Gilbert (Proc. Natl Acad Sci USA (1977) 74:560) or 
Sanger (Sanger et al (1977) Proc. Nat. Acad Sci 74:5463). It is also contemplated that any 
of a variety of automated sequencing procedures may be utilized when performing the 
subject assays (Biotechniques (1995) 19:448), including sequencing by mass spectrometry 
(see, for example, U.S. Patent No. 5,547,835 and international patent application 
Publication Number WO 94/16101, entitled DNA Sequencing by Mass Spectrometry by H. 
Koster; U.S. Patent No. 5,547,835 and international patent application Publication Number 
WO 94/21822 entitled "DNA Sequencing by Mass Spectrometry Via Exonuclease 
! Degradation" by H. Koster), and U.S Patent No.5,605,798 and International Patent 
Application No. PCT/US96/0365 1 entitled DNA Diagnostics Based on Mass Spectrometry 
by H. Koster;. Cohen et al. (1996) Adv Chromatogr 36:127-162; and Griffin et al. (1993) 
Appl Biochem Biotechnol 38:147-159). It will be evident to one skilled in the art that, for 
certain embodiments, the occurrence of only one, two or three of the nucleic acid bases need 
be determined in the sequencing reaction. For instance, A-track or the like, e.g., where only 
one nucleotide is detected, can be carried out. 

Yet other sequencing methods are disclosed, e.g., in U.S. Patent No. 5,580,732 
entitled "Method of DNA sequencing employing a mixed DNA-polymer chain probe" and 
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U.S. Patent No. 5,571,676 entitled "Method for mismatch-directed in vitro DNA 
sequencing". 

In some cases, the presence of a specific allele of an SR-BI gene in DNA from a 
subject can be shown by restriction enzyme analysis. For example, a specific nucleotide 

5 polymorphism can result in a nucleotide sequence comprising a restriction site which is 
absent from the nucleotide sequence of another allelic variant. In particular, the presence 
of a cytidine at position 54 of intron 5 creates an Apal site, whereas the presence of a 
thymidine, at this position destroys the Apal site. Similarly, the polymorphism of exon 1 
and exon 8 can be determined by analyzing the products or restriction digests (see Table 

10 VI). 

In a further embodiment, protection from cleavage agents (such as a nuclease, 
hydroxylamine or osmium tetroxide and with piperidine) can be used to detect mismatched 
bases in RNA/RNA DNA/DNA, or RNA/DNA heteroduplexes (Myers, et al. (1985) Science 
230:1242). In general, the technique of "mismatch cleavage" starts by providing 

1 5 heteroduplexes formed by hybridizing a control nucleic acid, which is optionally labeled, 
e.g., RNA or DNA, comprising a nucleotide sequence of an SR-BI allelic variant with a 
sample nucleic acid, e.g, RNA or DNA, obtained from a tissue sample. The double- 
stranded duplexes are treated with an agent which cleaves single-stranded regions of the 
duplex such as duplexes formed based on basepair mismatches between the control and 

20 sample strands. For instance, RNA/DNA duplexes can be treated with RNase and 
DNA/DNA hybrids treated with SI nuclease to enzymatically digest the mismatched 
regions. In other embodiments, either DNA/DNA or RNA/DNA duplexes can be treated 
with hydroxylamine or osmium tetroxide and with piperidine in order to digest mismatched 
regions. After digestion of the mismatched regions, the resulting material is then separated 

25 by size on denaturing polyacrylamide gels to determine whether the control and sample 
nucleic acids have an identical nucleotide sequence or in which nucleotides they are 
different. See, for example, Cotton et al (1988) Proc. Natl Acad Sci USA 85:4397; Saleeba 
et al (1992) Methods Enzymod. 217:286-295. In a preferred embodiment, the control or 
sample nucleic acid is labeled for detection. 

30 In other embodiments, alterations in electrophoretic mobility is used to identify the 

type of SR-BI allelic variant. For example, single strand conformation polymorphism 
(SSCP) may be used to detect differences in electrophoretic mobility between mutant and 
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wild type nucleic acids (Orita et al. (1989) Proc Natl Acad. Sci USA 86:2766, see also 
Cotton (1993) Mutat Res 285:125-144; and Hayashi (1992) Genet Anal TechAppl 9:73-79). 
Single-stranded DNA fragments of sample and control nucleic acids are denatured and 
allowed to renature. The secondary structure of single-stranded nucleic acids varies 
5 according to sequence, the resulting alteration in electrophoretic mobility enables the 
detection of even a single base change. The DNA fragments may be labeled or detected 
with labeled probes. The sensitivity of the assay may be enhanced by using RNA (rather 
than DNA), in which the secondary structure is more sensitive to a change in sequence. In 
another preferred embodiment, the subject method utilizes heteroduplex analysis to separate 
10 double stranded heteroduplex molecules on the basis of changes in electrophoretic mobility 
(Keen et al. (1991) Trends Genet 7:5). 

In yet another embodiment, the identity of an allelic variant of a polymorphic 
region is obtained by analyzing the movement of a nucleic acid comprising the 
polymorphic region in polyacrylamide gels containing a gradient of denaturant is assayed 
1 5 using denaturing gradient gel electrophoresis (DGGE) (Myers et al ( 1 985) Nature 3 1 3 :495). 
When DGGE is used as the method of analysis, DNA will be modified to insure that it does 
not completely denature, for example by adding a GC clamp of approximately 40 bp of 
high-melting GC-rich DNA by PCR. In a further embodiment, a temperature gradient is 
used in place of a denaturing agent gradient to identify differences in the mobility of control 
20 and sample DNA (Rosenbaum and Reissner (1987) Biophys Chem 265:1275). 

Examples of techniques for detecting differences of at least one nucleotide between 
2 nucleic acids include, but are not limited to, selective oligonucleotide hybridization, 
selective amplification, or selective primer extension. For example, oligonucleotide probes 
may be prepared in which the known polymorphic nucleotide is placed centrally (allele- 
25 specific probes) and then hybridized to target DNA under conditions which permit 
hybridization only if a perfect match is found (Saiki et al. (1986) Nature 324:163); Saiki et 
al (1989) Proc. Natl Acad Sci USA 86:6230; and Wallace et al. (1979) Nucl. Acids Res. 
6:3543). Such allele specific oligonucleotide hybridization techniques may be used for the 
simultaneous detection of several nucleotide changes in different polylmorphic regions of 
30 SR-BI. For example, oligonucleotides having nucleotide sequences of specific allelic 
variants are attached to a hybridizing membrane and this membrane is then hybridized with 
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labeled sample nucleic acid. Analysis of the hybridization signal will then reveal the 
identity of the nucleotides of the sample nucleic acid. 

Alternatively, allele specific amplification technology which depends on selective 
PCR amplification may be used in conjunction with the instant invention. Oligonucleotides 

5 used as primers for specific amplification may carry the allelic variant of interest in the 
center of the molecule (so that amplification depends on differential hybridization) (Gibbs 
et al (1 989) Nucleic Acids Res. 1 7:2437-2448) or at the extreme 3' end of one primer where, 
under appropriate conditions, mismatch can prevent, or reduce polymerase extension 
(Prossner (1993) Tibtech 1 1 :238; Newton et al. (1989) Nucl. Acids Res. 17:2503). This 

1 0 technique is also termed "PROBE" for Probe Oligo Base Extension. In addition it may be 
desirable to introduce a novel restriction site in the region of the mutation to create 
cleavage-based detection (Gasparini et al (1992) Mol. Cell Probes 6:1). 

In another embodiment, identification of the allelic variant is carried out using an 
oligonucleotide ligation assay (OLA), as described, e.g., in U.S. Pat. No. 4,998,617 and in 

15 Landegren, U. et al., Science 241:1077-1080 (1988). The OLA protocol uses two 
oligonucleotides which are designed to be capable of hybridizing to abutting sequences of 
a single strand of a target. One of the oligonucleotides is linked to a separation marker, e.g,. 
biotinylated, and the other is detectably labeled. If the precise complementary sequence is 
found in a target molecule, the oligonucleotides will hybridize such that their termini abut, 

20 and create a ligation substrate. Ligation then permits the labeled oligonucleotide to be 
recovered using avidin, or another biotin ligand. Nickerson, D. A. et al. have described a 
nucleic acid detection assay that combines attributes of PCR and OLA (Nickerson, D. A. 
et al., Proc. Natl. Acad. Sci. (U.S.A.) 87:8923-8927 (1990). In this method, PCR is used to 
achieve the exponential amplification of target DNA, which is then detected using OLA. 

25 

Several techniques based on this OLA method have been developed and can be 
used to detect specific allelic variants of a polymorphic region of an SR-BI gene. For 
example, U.S. Patent No. 5593826 discloses an OLA using an oligonucleotide having 
3'-amino group and a 5'-phosphorylated oligonucleotide to form a conjugate having a 
30 phosphoramidate linkage. In another variation of OLA described in Tobe et al. 
((1996)Nucleic Acids Res 24: 3728), OLA combined with PCR permits typing of two 
alleles in a single microtiter well. By marking each of the allele-specific primers with a 
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unique hapten, i.e. digoxigenin and fluorescein, each OLA reaction can be detected by using 
hapten specific antibodies that are labeled with different enzyme reporters, alkaline 
phosphatase or horseradish peroxidase. This system permits the detection of the two alleles 
using a high throughput format that leads to the production of two different colors. 

The invention further provides methods for detecting single nucleotide 
polymorphisms in an SR-BI gene. Because single nucleotide polymorphisms constitute 
sites of variation flanked by regions of invariant sequence, their analysis requires no more 
than the determination of the identity of the single nucleotide present at the site of variation 
and it is unnecessary to determine a complete gene sequence for each patient. Several 
methods have been developed to facilitate the analysis of such single nucleotide 
polymorphisms. 

In one embodiment, the single base polymorphism can be detected by using a 
specialized exonuclease-resistant nucleotide, as disclosed, e.g., in Mundy, C. R. (U.S. Pat. 
No.4,656,127). According to the method, a primer complementary to the allelic sequence 
immediately 3' to the polymorphic site is permitted to hybridize to a target molecule 
obtained from a particular animal or human. If the polymorphic site on the target molecule 
contains a nucleotide that is complementary to the particular exonuclease-resistant 
nucleotide derivative present, then that derivative will be incorporated onto the end of the 
hybridized primer. Such incorporation renders the primer resistant to exonuclease, and 
thereby permits its detection. Since the identity of the exonuclease-resistant derivative of 
the sample is known, a finding that the primer has become resistant to exonucleases reveals 
that the nucleotide present in the polymorphic site of the target molecule was 
complementary to that of the nucleotide derivative used in the reaction. This method has the 
advantage that it does not require the determination of large amounts of extraneous 
sequence data. 

In another embodiment of the invention, a solution-based method is used for 
determining the identity of the nucleotide of a polymorphic site. Cohen, D. et al. (French 
Patent 2,650,840; PCT Appln. No. W09 1/02087). As in the Mundy method of U.S. Pat. 
No. 4,656,127, a primer is employed that is complementary to allelic sequences 
immediately 3 f to a polymorphic site. The method determines the identity of the nucleotide 
of that site using labeled dideoxynucleotide derivatives, which, if complementary to the 
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nucleotide of the polymorphic site will become incorporated onto the terminus of the 
primer. 

An alternative method, known as Genetic Bit Analysis or GBA™ is described by 
Goelet, P. et al. (PCT Appln. No. 92/15712). The method of Goelet, P. et al. uses mixtures 

5 of labeled terminators and a primer that is complementary to the sequence 3' to a 
polymorphic site. The labeled terminator that is incorporated is thus determined by, and 
complementary to, the nucleotide present in the polymorphic site of the target molecule 
being evaluated. In contrast to the method of Cohen et al. (French Patent 2,650,840; PCT 
Appln. No. W09 1/02087) the method of Goelet, P. et al. is preferably a heterogeneous 

10 phase assay, in which the primer or the target molecule is immobilized to a solid phase. 

Recently, several primer-guided nucleotide incorporation procedures for assaying 
polymorphic sites in DNA have been described (Komher, J. S. et al., NucL Acids. Res. 
17:7779-7784 (1989); Sokolov, B. P., Nucl. Acids Res. 18:3671 (1990); Syvanen, A. -C, 
et al., Genomics 8:684-692 (1990); Kuppuswamy, M. N. et al., Proc. Natl. Acad. Sci. 

15 (U.S.A.) 88:1143-1147 (1991); Prezant, T. R. et ah, Hum. Mutat. 1:159-164 (1992); 
Ugozzoli, L. et al., GATA 9:107-1 12 (1992); Nyren, P. et al., Anal. Biochem. 208:171-175 
(1993)). These methods differ from GBA™ in that they all rely on the incorporation of 
labeled deoxynucleotides to discriminate between bases at a polymorphic site. In such a 
format, since the signal is proportional to the number of deoxynucleotides incorporated, 

20 polymorphisms that occur in runs of the same nucleotide can result in signals that are 
proportional to the length of the run (Syvanen, A. -C, et al., Amer.J. Hum. Genet. 52:46-59 
(1993)). 

For determining the identity of the allelic variant of a polymorphic region located 
in the coding region of an SR-BI gene, yet other methods than those described above can 

25 be used. For example, identification of an allelic variant which encodes a mutated SR-BI 
protein can be performed by using an antibody specifically recognizing the mutant protein 
in, e.g., immunohistochemistry or immunoprecipitation. Antibodies to wild-type SR-BI 
protein are described, e.g, in Acton et al. (1999) Science 271:518 (anti-mouse SR-BI 
antibody cross-reactive with human SR-BI). Other antibodies to wild-type SR-BI or 

30 mutated forms of SR-BI proteins can be prepared according to methods known in the art. 
Preferred antibodies specifically bind to a human SR-BI protein having a serine at residue 
2 and/or having an isoleucine at amino acid residue 135. Alternatively, one can also 
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measure an activity of an SR-BI protein, such as binding to a lipid or lipoprotein. Binding 
assays are known in the art and involve, e.g., obtaining cells from a subject, and performing 
binding experiments with a labeled lipid, to determine whether binding to the mutated form 
of the receptor differs from binding to the wild-type of the receptor. 

5 Antibodies directed against wild type or mutant SR-B1 polypeptides or allelic 

variant thereof, which are discussed above, may also be used in disease diagnostics and 
prognostics. Such diagnostic methods, may be used to detect abnormalities in the level of 
SR-BI polypeptide expression, or abnormalities in the structure and/or tissue, cellular, or 
subcellular location of an SR-BI polypeptide. Structural differences may include, for 

10 example, differences in the size, electronegativity, or antigenicity of the mutant SR-BI 
polypeptide relative to the normal SR-BI polypeptide. Protein from the tissue or cell type 
to be analyzed may easily be detected or isolated using techniques which are well known 
to one of skill in the art, including but not limited to western blot analysis. For a detailed 
explanation of methods for carrying out Western blot analysis, see Sambrook et al, 1989, 

15 supra, at Chapter 1 8. The protein detection and isolation methods employed herein may 
also be such as those described in Harlow and Lane, for example, (Harlow, E. and Lane, D., 
1988, "Antibodies: A Laboratory Manual", Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York), which is incorporated herein by reference in its entirety. 

This can be accomplished, for example, by immunofluorescence techniques 

20 employing a fluorescently labeled antibody (see below) coupled with light microscopic, 
flow cytometric, or fluorimetric detection. The antibodies (or fragments thereof) useful in 
the present invention may, additionally, be employed histologically, as in 
immunofluorescence or immunoelectron microscopy, for in situ detection of SR-BI 
polypeptides. In situ detection may be accomplished by removing a histological specimen 

25 from a patient, and applying thereto a labeled antibody of the present invention. The 
antibody (or fragment) is preferably applied by overlaying the labeled antibody (or 
fragment) onto a biological sample. Through the use of such a procedure, it is possible to 
determine not only the presence of the SR-BI polypeptide, but also its distribution in the 
examined tissue. Using the present invention, one of ordinary skill will readily perceive 

30 that any of a wide variety of histological methods (such as staining procedures) can be 
modified in order to achieve such in situ detection. 

Often a solid phase support or carrier is used as a support capable of binding an 
antigen or an antibody. Well-known supports or carriers include glass, polystyrene, 
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polypropylene, polyethylene, dextran, nylon, amylases, natural and modified celluloses, 
polyacrylamides, gabbros, and magnetite. The nature of the carrier can be either soluble to 
some extent or insoluble for the purposes of the present invention. The support material 
may have virtually any possible structural configuration so long as the coupled molecule 

5 is capable of binding to an antigen or antibody. Thus, the support configuration may be 
spherical, as in a bead, or cylindrical, as in the inside surface of a test tube, or the external 
surface of a rod. Alternatively, the surface may be flat such as a sheet, test strip, etc. 
Preferred supports include polystyrene beads. Those skilled in the art will know many other 
suitable carriers for binding antibody or antigen, or will be able to ascertain the same by use 

1 0 of routine experimentation. 

One means for labeling an anti-SR-BI polypeptide specific antibody is via linkage 
to an enzyme and use in an enzyme immunoassay (EIA) (Voller, "The Enzyme Linked 
Immunosorbent Assay (ELISA)", Diagnostic Horizons 2:1-7, 1978, Microbiological 
Associates Quarterly Publication, Walkersville, MD; Voller, et al., J. Clin. Pathol. 31:507- 

15 520 (1978); Butler, Meth. Enzymol. 73:482-523 (1981); Maggio, (ed.) Enzyme 
Immunoassay, CRC Press, Boca Raton, FL, 1980; Ishikawa, et al., (eds.) Enzyme 
Immunoassay, Kgaku Shoin, Tokyo, 1981). The enzyme which is bound to the antibody 
will react with an appropriate substrate, preferably a chromogenic substrate, in such a 
manner as to produce a chemical moiety which can be detected, for example, by 

20 spectrophoto metric, fluorimetric or by visual means. Enzymes which can be used to 
detectably label the antibody include, but are not limited to, malate dehydrogenase, 
staphylococcal nuclease, delta-5-steroid isomerase, yeast alcohol dehydrogenase, alpha- 
glycerophosphate, dehydrogenase, triose phosphate isomerase, horseradish peroxidase, 
alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, 

25 urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase and 
acetylcholinesterase. The detection can be accomplished by colorimetric methods which 
employ a chromogenic substrate for the enzyme. Detection may also be accomplished by 
visual comparison of the extent of enzymatic reaction of a substrate in comparison with 
similarly prepared standards. 

30 Detection may also be accomplished using any of a variety of other immunoassays. 

For example, by radioactively labeling the antibodies or antibody fragments, it is possible 
to detect fingerprint gene wild type or mutant peptides through the use of a 
radioimmunoassay (RIA) (see, for example, Weintraub, B., Principles of 
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Radioimmunoassays, Seventh Training Course on Radioligand Assay Techniques, The 

Endocrine Society, March, 1986, which is incorporated by reference herein). The 

radioactive isotope can be detected by such means as the use of a gamma counter or a 

scintillation counter or by autoradiography. 

It is also possible to label the antibody with a fluorescent compound. When the 

fluorescently labeled antibody is exposed to light of the proper wave length, its presence can 

then be detected due to fluorescence. Among the most commonly used fluorescent labeling 

compounds are fluorescein isothiocyanate, rhodamine, phycoerythrin, phycocyanin, 

allophycocyanin, o-phthaldehyde and fluorescamine. 

The antibody can also be detectably labeled using fluorescence emitting metals 
1 52 

such as Eu, or others of the lanthanide series. These metals can be attached to the 
antibody using such metal chelating groups as diethylenetriaminepentacetic acid (DTP A) 
or ethylenediaminetetraacetic acid (EDTA). 

The antibody also can be detectably labeled by coupling it to a chemi luminescent 
compound. The presence of the chemi luminescent-tagged antibody is then determined by 
detecting the presence of luminescence that arises during the course of a chemical reaction. 
Examples of particularly useful chemiluminescent labeling compounds are luminol, 
isoluminol, theromatic acridinium ester, imidazole, acridinium salt and oxalate ester. 

Likewise, a bioluminescent compound may be used to label the antibody of the 
present invention. Bioluminescence is a type of chemiluminescence found in biological 
systems in. which a catalytic protein increases the efficiency of the chemiluminescent 
reaction. The presence of a bioluminescent protein is determined by detecting the presence 
of luminescence. Important bioluminescent compounds for purposes of labeling are 
luciferin, luciferase and aequorin. 

Moreover, it will be understood that any of the above methods for detecting 
alterations in a gene or gene product or polymorphic variants can be used to monitor the 
course of treatment or therapy. 

If a polymorphic region is located in an exon, either in a coding or non-coding 
portion of the gene, the identity of the allelic variant can be determined by determining the 
molecular structure of the mRNA, pre-mRNA, or cDNA. The molecular structure can be 
determined using any of the above described methods for determining the molecular 
structure of the genomic DNA, e.g., sequencing and SSCP. 
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The methods described herein may be performed, for example, by utilizing pre- 
packaged diagnostic kits, such as those described above, comprising at least one probe or 
primer nucleic acid described herein, which may be conveniently used, e.g., to determine 
whether a subject has or is at risk of developing a disease associated with a specific SR-BI 
5 allelic variant. 

Sample nucleic acid for using in the above-described diagnostic and prognostic 
methods can be obtained from any cell type or tissue of a subject. For example, a subject's 
bodily fluid (e.g. blood) can be obtained by known techniques (e.g. venipuncture). 
Alternatively, nucleic acid tests can be performed on dry samples (e.g. hair or skin). Fetal 
10 nucleic acid samples can be obtained from maternal blood as described in International 
Patent Application No. W09 1/07660 to Bianchi. Alternatively, amniocytes or chorionic 
villi may be obtained for performing prenatal testing. 

Diagnostic procedures may also be performed in situ directly upon tissue sections 
(fixed and/or frozen) of patient tissue obtained from biopsies or resections, such that no 
1 5 nucleic acid purification is necessary. Nucleic acid reagents may be used as probes and/or 
primers for such in situ procedures (see, for example, Nuovo, G.J., 1992, PCR in situ 
hybridization: protocols and applications, Raven Press, NY). 

In addition to methods which focus primarily on the detection of one nucleic acid 
sequence, profiles may also be assessed in such detection schemes. Fingerprint profiles 
20 may be generated, for example, by utilizing a differential display procedure, Northern 
analysis and/or RT-PCR. 

Pharmacogenomics 

Knowledge of the identity of the allele of one or more SR-BI gene polymorphic 
25 regions in an individual (the SR-BI genetic profile), alone or in conjunction with 
information on other genetic defects contributing to the same disease (the genetic profile 
of the particular disease) allows a customization of the therapy for a particular disease to 
the individual's genetic profile, the goal of "pharmacogenomics". For example, subjects 
having a specific allele of an SR-BI gene may or may not exhibit symptoms of a particular 
30 disease or be predisposed to developing symptoms of a particular disease. Further, if those 
subjects are symptomatic, they may or may not respond to a certain drug, e.g., a specific 
SR-BI therapeutic, but may respond to another. Thus, generation of an SR-BI genetic 
profile, (e.g., categorization of alterations in SR-BI genes which are associated with the 
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development of a particular disease), from a population of subjects, who are symptomatic 
for a disease or condition that is caused by or contributed to by a defective and/or deficient 
SR-BI gene and/or protein (an SR-BI genetic population profile) and comparison of an 
individual's SR-BI profile to the population profile, permits the selection or design of drugs 

5 that are expected to be safe and efficacious for a particular patient or patient population (i.e., 
a group of patients having the same genetic alteration). 

For example, an SR-BI population profile can be performed by determining the SR- 
BI profile, e.g., the identity of SR-BI alleles, in a patient population having a disease, which 
is associated with one or more specific alleles of SR-BI polymorphic regions. Optionally, 

10 the SR-BI population profile can further include information relating to the response of the 
population to an SR-BI therapeutic, using any of a variety of methods, including, 
monitoring: 1) the severity of symptoms associated with the SR-BI related disease, 2) SR- 
BI gene expression level, 3) SR-BI mRNA level, and/or 4) SR-BI protein level, and (iii) 
dividing or categorizing the population based on particular SR-BI alleles. The SR-BI 

1 5 genetic population profile can also, optionally, indicate those particular SR-BI alleles which 
are present in patients that are either responsive or non-responsive to a particular 
therapeutic. This information or population profile, is then useful for predicting which 
individuals should respond to particular drugs, based on their individual SR-BI profile. 

In a preferred embodiment, the SR-BI profile is a transcriptional or expression level 

20 profile and step (i) is comprised of determining the expression level of SR-BI proteins, 
alone or in conjunction with the expression level of other genes known to contribute to the 
same disease at various stages of the disease. 

Pharmacogenomic studies can also be performed using transgenic animals. For 
example, one can produce transgenic mice, e.g., as described herein, which contain a 

25 specific allelic variant of an SR-BI gene. These mice can be created, e.g, by replacing their 
wild-type SR-BI gene with an allele of the human SR-BI gene. The response of these mice 
to specific SR-BI therapeutics can then be determined. 

Monitoring Ef fects of SR-BI Therapeutics During Clinical Trials 
30 The ability to target populations expected to show the highest clinical benefit, based 

on the SR-BI or disease genetic profile, can enable: 1) the repositioning of marketed drugs 
with disappointing market results; 2) the rescue of drug candidates whose clinical 
development has been discontinued as a result of safety or efficacy limitations, which are 
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patient subgroup-specific; and 3) an accelerated and less costly development for drug 
candidates and more optimal drug labeling (e.g. since the use of SR-BI as a marker is useful 
for optimizing effective dose). 

In situations in which the disease associated with a specific SR-BI allele is 

5 characterized by an abnormal SR-BI expression, the treatment of an individual with an SR- 
BI therapeutic can be monitored by determining SR-BI characteristics, such as SR-BI 
protein level or activity, SR-BI mRNA level, and/or SR-BI transcriptional level. This 
measurement will indicate whether the treatment is effective or whether it should be 
adjusted or optimized. Thus, SR-BI can be used as a marker for the efficacy of a drug 

1 0 during clinical trials. 

In a preferred embodiment, the present invention provides a method for monitoring 
the effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate 
identified by the screening assays described herein) comprising the steps of (i) obtaining a 

1 5 preadministration sample from a subject prior to administration of the agent; (ii) detecting 
the level of expression of an SR-BI protein, mRNA, or genomic DNA in the 
preadministration sample; (iii) obtaining one or more post-administration samples from the 
subject; (iv) detecting the level of expression or activity of the SR-BI protein, mRNA, or 
genomic DNA in the post-administration samples; (v) comparing the level of expression or 

20 activity of the SR-BI protein, mRNA, or genomic DNA in the preadministration sample 
with the SR-BI protein, mRNA, or genomic DNA in the post administration sample or 
samples; and (vi) altering the administration of the agent to the subject accordingly. For 
example, increased administration of the agent may be desirable to increase the expression 
or activity of SR-BI to higher levels than detected, i.e., to increase the effectiveness of the 

25 agent. Alternatively, decreased administration of the agent may be desirable to decrease 
expression or activity of SR-BI to lower levels than detected, i.e., to decrease the 
effectiveness of the agent. 

Cells of a subject may also be obtained before and after administration of an SR-BI 
therapeutic to detect the level of expression of genes other than SR-BI, to verify that the SR- 

30 BI therapeutic does not increase or decrease the expression of genes which could be 
deleterious. This can be done, e.g., by using the method of transcriptional profiling. Thus, 
mRNA from cells exposed in vivo to an SR-BI therapeutic and mRNA from the same type 
of cells that were not exposed to the SR-BI therapeutic could be reverse transcribed and 
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hybridized to a chip containing DNA from numerous genes, to thereby compare the 
expression of genes in cells treated and not treated with an SR-BI therapeutic. If, for 
example an SR-BI therapeutic turns on the expression of a proto-oncogene in an individual, 
use of this particular SR-BI therapeutic may be undesirable. 

5 

Methods of Treatment 

The present invention provides for both prophylactic and therapeutic methods 
of treating a subject having or likely to develop a disorder associated with specific SR-BI 
alleles and/or aberrant SR-BI expression or activity, e.g., disorders or diseases associated 
10 with an abnormal BMI or lipid levels. 

Prophylactic Methods 

In one aspect, the invention provides a method for preventing in a subject, a disease 
or condition associated with a specific SR-BI allele and/or an aberrant SR-BI expression or 

15 activity, such as a body mass disorder or abnormal lipid level and medical conditions 
resulting therefrom, by administering to the subject an agent which counteracts the 
unfavorable biological effect of the specific SR-BI allele. Subjects at risk for such a disease 
can be identified by a diagnostic or prognostic assay, e.g., as described herein. 
Administration of a prophylactic agent can occur prior to the manifestation of symptoms 

20 associated with specific SR-BI alleles, such that a disease or disorder is prevented or, 
alternatively, delayed in its progression. Depending on the identity of the SR-BI allele in 
a subject, a compound that counteracts the effect of this allele is administered. The 
compound can be a compound modulating the plasma level of lipids. The treatment can 
also be a specific diet. In particular, the treatment can be undertaken prophylactically, 

25 before any other symptoms are present. Such a prophylactic treatment could thus prevent 
the development of an abnormal BMI or lipid level, e.g., abnormally high LDL level or 
abnormally low HDL. The prophylactic methods are similar to therapeutic methods of the 
present invention and are further discussed in the following subsections. 

30 Therapeutic Methods 

The invention further provides methods of treating subjects having a disease or 
disorder associated with a specific allelic variant of a polymorphic region of an SR-BI gene. 
Preferred diseases or disorders include those associated with an abnormal body mass, or 
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abnormal lipoprotein (LDL and HDL) levels and disorders resulting therefrom (e.g. 
cardiovascular disease, obesity, cachexia and gallstone formation). In one embodiment, the 
method comprises (a) determining the identity of the allelic variant; and (b) administering 
to the subject a compound that compensates for the effect of the specific allelic variant. The 
5 polymorphic region can be localized at any location of the gene, e.g., in the promoter (e.g., 
in a regulatory element of the promoter), in an exon, (e.g., coding region of an exon), in an 
intron, or at an exon/intron border. Thus, depending on the site of the polymorphism in the 
SR-BI gene, a subject having a specific variant of the polymorphic region which is 
associated with a specific disease or condition, can be treated with compounds which 
10 specifically compensate for the allelic variant. 

In a preferred embodiment, the identity of one or more of the following nucleotides 
of an SR-BI gene of a subject is determined: nucleotide 146 of exon 1, nucleotide 1 19 of 
exon 3, nucleotide 41 of exon 8, nucleotide 54 of intron 5, and nucleotide -41 of intron 10. 
If a female subject has the more common allele of residue 41 of exon 8, high LDL levels 
15 and resulting cardiovascular disorders can be prevented from occurring or can be reduced, 
by administering to the subject a pharmaceutical ly effective amount of a compound to 
reduce LDL level to a normal LDL level. Similarly, if a female subject has the less 
common allele of residue 54 of intron 5, a high BMI and/or high LDL level and 
consequences thereof, such as diabetes and cardiovascular disorders, can be prevented from 
20 occurring or reduced, by administering to the subject a pharmaceutically effective amount 
of a compound to reduce the BMI. In another embodiment of the invention, if a male 
subject has the more common allele at residue 41 of exon 8, the more common allele at 
residue 54 of intron 5, and the more common allele at residue 146 of exon 1, development 
of low HDL levels can be prevented or increased by administering to the subject a 
25 pharmaceutically effective amount of a compound that increases HDL levels. 

Generally, the allelic variant can be a mutant allele, i.e., an allele which when 
present in one, or preferably two copies, in a subject results in a change in the phenotype 
of the subject. A mutation can be a substitution, deletion, and/or addition of at least one 
nucleotide relative to the wild-type allele. Depending on where the mutation is located in 
30 the SR-BI gene, the subject can be treated to specifically compensate for the mutation. For 
example, if the mutation is present in the coding region of the gene and results in an inactive 
or less active SR-BI protein, the subject can be treated, e.g., by administration to the subject 
of a nucleic acid encoding a wild-type SR-BI protein, such that the expression of the wild- 



WO 99/02735 



PCT/US98/14354 



-61 - 

type SR-BI protein compensates for the endogenous mutated form of the SR-BI protein. 
Nucleic acids encoding wild-type human SR-BI protein are set forth in SEQ ID Nos. 1 and 
3 and are described, e.g., in Calvo and Vega (1993) J. Biol. Chem. 268:18929. 

Furthermore, depending on the site of the mutation in the SR-BI protein and the 

5 specific effect on its activity, specific treatments can be designed to compensate for that 
effect. The SR-BI protein is a cell surface receptor which binds specific forms of lipids, 
e.g., modified lipid or lipoproteins, e.g., HDL. Thus, an SR-BI protein has an extracellular 
domain which binds specific molecules, e.g., lipids, a transmembrane domain, and an 
intracellular domain, which is likely to transmit an intracellular signal. The structure of SR- 

10 BI proteins is ftirther described, e.g., in Calvo and Vega, supra; Acton et al. (1994) J. Biol. 
Chem. 269:21003; Acton et al. (1995) Science 271 :518; Rigotti et al. (1995) J. Biol. Chem. 
270:16221; Fukasawa et al. (1996) Exp. Cell. Res. 222:246; Wang et al. (1996) J. Biol. 
Chem. 271:21001; and published PCT Application having publication number WO 
96/00288 by Acton et al. Thus, if the mutation results in an SR-BI protein which is less 

1 5 capable of binding certain types of modified lipids, resulting in an accumulation of such 
lipids in the subject, a treatment can be designed which removes such modified lipids from 
the subject. In one embodiment, a compound which binds this form of lipid and is capable 
of targeting the lipid to a site where it is eliminated, is administered to the subject. 
Alternatively, the expression of another cell surface receptor which binds this type of lipid 

20 can be increased. In fact, both SR-BI and the class B scavenger receptor CD36 are capable 
of interacting with anionic phospholipids (Rigotti et al., supra). Thus, if a subject has a 
mutant SR-BI protein which is defective in its binding to anionic phospholipids, the subject 
can be treated by administration of a compound which increases CD36 protein levels in the 
cells. 

25 In situations in which the mutant SR-BI protein binds certain forms of lipids with 

higher affinity, and if this is causing or contributing to a disease, a subject having such a 
mutated SR-BI protein can be treated, e.g., by administration of compounds which inhibit 
or decrease the interaction between the specific form of the lipid and SR-BI. For example, 
soluble forms of SR-BI proteins or binding fragments thereof, can be administered to the 

30 subject. Alternatively, small molecules can be administered to the subject for interfering 
in the interaction between SR-BI and a lipid. 
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A mutant SR-BI protein can also be an SR-B1 protein having a mutation in the 
cytoplasmic domain of the protein which results in an aberrant signal transduction from the 
receptor. Subjects having such a mutation can be treated, e.g., by administration of 
compounds which induce the same or similar signal transduction or compounds which act 

5 downstream of the receptor. 

The effect of a mutation in an SR-BI protein can be determined according to 
methods known in the art. For example, if the mutation is located in the extracellular 
portion of the protein, one can perform binding assays with specific forms of lipids, e.g., 
HDL, and determine whether the binding affinity of such lipid with the mutated SR-BI 

10 protein is different from the binding affinity of the lipid with the wild-type protein. Such 
assays can be performed using a soluble form of an SR-BI protein or a membrane bound 
form of the protein. If the mutation in the SR-BI protein is located in the cytoplasmic 
domain of the protein, signal transduction experiments can be performed to determine 
whether the signal transduced from the mutated receptor differs from the signal transduced 

1 5 from the wild-type receptor. Alternatively, one can also investigate whether binding to a 
protein which interacts with the cytoplasmic domain of the receptor is affected by the 
mutation. Such determination can be made by, e.g., by immunoprecipitation. 

Yet in another embodiment, the invention provides methods for treating a subject 
having a mutated SR-BI gene, in which the mutation is located in a regulatory region of the 

20 gene. Such a regulatory region can be localized in the promoter of the gene, in the 5' or 3' 
untranslated region of an exon, or in an intron. A mutation in a regulatory region can result 
in increased production of SR-BI protein, decreased production of SR-BI protein, or 
production of SR-BI having an aberrant tissue distribution. The effect of a mutation in a 
regulatory region upon the SR-BI protein can be determined, e.g., by measuring the SR-BI 

25 protein level or mRNA level in cells having an SR-BI gene having this mutation and which, 
normally (i.e., in the absence of the mutation) produce SR-BI protein. The effect of a 
mutation can also be determined in vitro. For example, if the mutation is in the promoter, 
a reporter construct can be constructed which comprises the mutated promoter linked to a 
reporter gene, the construct transfected into cells, and comparison of the level of expression 

30 of the reporter gene under the control of the mutated promoter and under the control of a 
wild-type promoter. Such experiments can also be carried out in mice transgenic for the 
mutated promoter. If the mutation is located in an intron, the effect of the mutation can be 
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determined, e.g., by producing transgenic animals in which the mutated SR-BI gene has 
been introduced and in which the wild-type gene may have been knocked out. Comparison 
of the level of expression of SR-BI in the mice transgenic for the mutant human SR-BI gene 
with mice transgenic for a wild-type human SR-BI gene will reveal whether the mutation 

5 results in increased, decreased synthesis of the SR-BI protein and/or aberrant tissue 
distribution of SR-BI protein. Such analysis could also be performed in cultured cells, in 
which the human mutant SR-BI gene is introduced and, e.g., replaces the endogenous wild- 
type SR-BI gene in the cell. Thus, depending on the effect of the mutation in a regulatory 
region of an SR-BI gene, a specific treatment can be administered to a subject having such 

10 a mutation. Accordingly, if the mutation results in decreased production of an SR-BI 
protein, the subject can be treated by administration of a compound which increases 
synthesis, such as by increasing SR-BI gene expression, and wherein the compound acts at 
a regulatory element different from the one which is mutated. Alternatively, if the mutation 
results in increased SR-BI protein levels, the subject can be treated by administration of a 

15 compound which reduces SR-BI protein production, e.g., by reducing SR-BI gene 
expression or a compound which inhibits or reduces the activity of SR-BI. 

Furthermore, it is likely that subjects having different allelic variants of an SR-BI 
polymorphic region will respond differently to therapeutic drugs to treat diseases or 
conditions, such as those associated with an abnormal lipid level. Cholesterol-lowering 

20 drugs include lovastatin (MEVACOR; Merck & Co.), simvastatin (ZOCOR; Merck & Co.), 
dextrothyroxine (CHOLOXIN; Knoll Pharmaceutical Co.), pamaqueside (Pfizer), 
cholestryramine (QUESTRAN; Bristol-Myers Squibb), colestipol (COLESTID; Pharmacia 
& Upjohn), acipomox (Pharmacia & Upjohn), fenofibrate (LIPIDIL), gemfibrozil (LOPID; 
Warner-Lambert), cerivastatin (LIPOBAY; Bayer), fluvastatin (LESCOL; Novartis), 

25 atorvastatin (LIPITOR, Warner-Lambert), etofylline clofibrate (DUOLIP; Merckle 
(Germany)), probucol (LORELCO; Hoechst Marion Roussel), omacor (Pronova (Norway), 
etofibrate (Merz (Germany), clofibrate (ATROMID-S; Wyeth-Ayerst (AHP)), and niacin 
(numerous manufacturers). Drugs for treating obesity and/or gallstones include 
dexfenfluramine (REDUX, Interneuron Pharmaceuticals), megestrol acetate (MEGACE, 

30 Bristol-Myers Squibb), Phenylpropanolamine (ACUTRIM; Ciba; and DEXUTRIM; 
Thompson), fluoxetine (PROZAC, Lilly), dextroamphetamine (DEXEDRINE, SmithKline 
Beecham), fenfluramine and phentermine, chenodiol (CHENIX, Solvay), orlistat 
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(XENICAL, Roche), anandamide (Yissum (Israel)), PCM-4 (Omega Pharmaceutical), 
mono-octanoin (MOCTAN, Stokely-van Camp), sibutramine (MERIDIA, Knoll), 
testosterone (TESTODERM, Alza), oxandrolone (OXANDRIN, Bio-Technology General), 
ceruletide diethylamine (TYMTRAN, Pharmacia & Upjohn), testosterone and 

5 dihydrotestosterone (ANDROGEL and ANDROGEL-DHT, unimed), somatropin 
(SEROSTIM, Ares-Serono and BIO-TROPIN, Biotechnology General), and thalidomide 
(SYNOVIR, Celgene). 

A correlation between drug responses and specific alleles of SR-BI can be shown, 
for example, by clinical studies wherein the response to specific drugs of subjects having 

10 different allelic variants of a polymorphic region of an SR-BI gene is compared. Such 
studies can also be performed using animal models, such as mice having various alleles of 
human SR-BI genes and in which, e.g., the endogenous SR-BI has been inactivated such 
as by a knock-out mutation. Test drugs are then administered to the mice having different 
human SR-BI alleles and the response of the different mice to a specific compound is 

1 5 compared. Accordingly, the invention provides assays for identifying the drug which will 
be best suited for treating a specific disease or condition in a subject. For example, it will 
be possible to select drugs which will be devoid of toxicity, or have the lowest level of 
toxicity possible for treating a subject having a disease or condition. 

20 

Other uses for the nucleic acids of the invention 

The identification of different alleles of SR-BI can also be useful for identifying 
an individual among other individuals from the same species. For example, DNA 
sequences can be used as a fingerprint for detection of different individuals within the same 

25 species (Thompson, J. S. and Thompson, eds., Genetics in Medicine, WB Saunders Co., 
Philadelphia, PA (1991)). This is useful, e.g., in forensic studies. 

The present invention is further illustrated by the following examples which should 
not be construed as limiting in any way. The contents of all cited references (including 
literature references, issued patents, published patent applications as cited throughout this 

30 application are hereby expressly incorporated by reference. The practice of the present 
invention will employ, unless otherwise indicated, conventional techniques of cell biology, 
cell culture, molecular biology, transgenic biology, microbiology, recombinant DNA, and 
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immunology, which are within the skill of the art. Such techniques are explained fully in 
the literature. See, for example, Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by 
Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989); DNA 
Cloning, Volumes I and II (D. N. Glover ed., 1985); Oligonucleotide Synthesis (M. J. Gait 

5 ed., 1984); Mullis et ah U.S. Patent No: 4,683,195; Nucleic Acid Hybridization (B. D. 
Hames & S. J. Higgins eds. 1984); Transcription And Translation (B. D. Hames & S. J. 
Higgins eds. 1984); Culture Of Animal Cells (R. I. Freshney, Alan R. Liss, Inc., 1987); 
Immobilized Cells And Enzymes (IRL Press, 1986); B. Perbal, A Practical Guide To 
Molecular Cloning (1984); the treatise, Methods In Enzymology (Academic Press, Inc., 

10 N.Y.); Gene Transfer Vectors For Mammalian Cells (J. H. Miller and M. P. Calos eds., 
1987, Cold Spring Harbor Laboratory); Methods In Enzymology, Vols. 154 and 155 (Wu 
et al. eds.), Immunochemical Methods In Cell And Molecular Biology (Mayer and Walker, 
eds., Academic Press, London, 1987); Handbook Of Experimental Immunology, Volumes 
I-IV (D. M. Weir and C. C. Blackwell, eds., 1986); Manipulating the Mouse Embryo, (Cold 

15 Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). 

5. EXAMPLE 

The invention now being generally described, it will be more readily understood 
by reference to the following examples which are included merely for purposes of 
20 illustration of certain aspects and embodiments of the present invention, and are not 
intended to limit the invention. 

Example 1: Isolation and sequence analysis of genomic DNA encoding the human 
SR-BI protein 

25 A probe consisting of a 474 base pair fragment of the human SR-BI cDNA was 

used to isolate bacterial artificial chromosomes (BACs) containing genomic DNA encoding 
the human SR-BI protein from a human BAC library (Research Genetics Inc. (Huntsville, 
AL) Cat.#96041 )). Two BACs were isolated by hybridizing the probe to this library. These 
BACs were then sized by pulse-field electrophoresis and the inserts were found to be 

30 approximately 80 and 70 kilobases long for BAC 179m 10 and BAC 256il9, respectively. 
All further discussion will focus on BAC 179ml0. 
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BAC 179m 10 was digested with restriction enzymes and analyzed by Southern blot 
hybridization with portions of human SR-BI cDNA, and shown to contain a large portion 
of the SR-BI sequence. This BAC was then sheared by nebulizing the DNA into fragments 
of approximately 1 -3 kb which were inserted into the pminisk vector and the resulting insert 

5 sizes ranged from 1-3 kb. Initially, clones which hybridized to the coding sequence of the 
full-length human SR-BI cDNA were sequenced, leading to the identification of most of the 
exons of the gene. Further random sequencing of the BAC sheared library led to the 
identification of the remaining coding exons and the adjacent intron flanking sequences. 
Sequence analysis of the genomic DNA indicated that the human SR-BI gene is at 

10 least 50 kb and contains 12 coding exons and one non-coding exon (exon 13, which 
contains the entire 3' untranslated region). The genomic structure of human SR-BI is shown 
in Figure 1. The nucleotide sequence of the exons and portions of the introns which are 
adjacent to the exons is shown in Figure 2. The coding region of the human SR-BI gene 
consists of 12 exons (see Table I in the Detailed Description). The location of introns 

15 relative to the nucleotide sequence of a cDNA encoding human SR-BI is shown in Figure 
2 and Figure 3 and indicated in Table II of the Detailed Description. The portions of the 
protein encoded by each of the exons is also shown in Figure 3 and in Table II. 

A number of the introns are extremely large (> 10 kb) (see Table I in the Detailed 
Description). The intron/exon boundaries were remarkably similar to those found in the 

20 human CD36 gene, which is a member of the same protein family as SR-BI (Tang et al. 
(1994) J. Biol. Chem. 269:601 1). 



Example 2: Identification of primer pairs to isolate intronic. exonic* and promoter 
25 sequences for detection of polymorphisms and mutations 

Multiple pairs of primers were synthesized in order to amplify each of the exons 
or portions thereof and adjacent intronic regions. Genomic DNA from a human subject was 
subjected to PCR in 25 \i\ reactions (lx PCR Amplitaq polymerase buffer, 0. 1 mM dNTPs, 
0.8 \iM 5' primer, 0.8 \iM 3' primer, 0.75 units of Amplitaq polymerase, 50 ng genomic 
30 DNA) using each of the above described pairs of primers under the following cycle 
conditions: 94°C for 2 min, 35 x [94°C for 40 sec, annealing temp for 30 sec, 72°C for 1 
min], 72°C for 5 min, 4°C hold. The resulting PCR products were analyzed on a 2% agarose 
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10 



15 



gel. The identity of the PCR product was confirmed by digestion with a restriction enzyme 
and subsequent agarose electrophoresis. Twelve pairs of oligomers were chosen to serve 
as PCR primers to amplify regions containing each of the 12 coding exons of the human 
SR-BI gene and one pair of primers was chosen to serve as PCR primers to amplify a 
promoter region. The nucleotide sequence of these primers in indicated in Table VI and 
nucleotide sequences to which these primers bind are shown in Figure 2. The optimum 
PCR annealing temperature for each primer pair as well as the expected sizes of the PCR 
products and diagnostic restriction sites is set forth below in Table V. Table V also 
indicates the size of DNA fragments obtained when digesting the amplified fragment with 
the restriction enzyme indicated in the table. A PCR reaction using primers having SEQ 
ID NO: 41 and 42 for amplifying exon 1 is preferably carried out in the presence of 10% 
DMSO. 

Table IV 

fiifin primer name SEP ID NO: Nucleotide Sequence (5' -> 3') 

CCCCTGCCGCCGGAATCCTGAAG 



5el6srbl 
3el6srbl 



41 
42 



CGCTTTGGCGGAGCAGCCCATGTC 



20 



5e22srbl 
3e22srbl 

5e30srbl 
3e30srbl 



43 
44 

45 
46 



TGGGGCCCTCATCACTCTCCTCAC 
GCAGCCTCCCCATCCCGTCCACT 

ATTGCAGGCGAGTAGAAG 
CAGGCGGGAGGAGAGACA 



25 



5e41srbl 
3e4Isrbl 



47 
48 



TGGGCTCTTTGCTGTGAGGC 
CCAGGCTGTGTGAGGGGAAG 



5e50srbl 
3e50srbl 



49 
50 



GCCCAGAATGTTCAGACCAG 
GCACCCTCTTCACGACAAAG 



30 



5e60srbl 
3e60srbl 



51 
52 



CACCTGAGAGGGCTTATTA 
CAAAATGCTTTCCAAGTGC 



35 



5e71srbl 
3e71srbl 



53 
54 



GCCGCCGGGTCTGGGTGTCC 
CAGAGGCCAGAGATTAAGCAGAC 



5e81srbl 



55 



TTGTATGATGTCCCCTCCCT 
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3e81srbl 

9 5e9lsrbl 
3e9lsrbl 

10 5el04srbl 
3el04srbl 

11 5ell2srbl 
3ell2srbl 

12 5el00srbl 
3el00srbl 

Prom. 5pl3srbl 
5pl3srbl 



Exon primer pairs 

1 SEQIDNO:41 
SEQ ID NO: 42 

2 SEQ ID NO: 43 
SEQ ID NO: 44 

3 SEQ ID NO: 45 
SEQ ID NO: 46 

4 SEQ ID NO: 47 
SEQ ID NO: 48 

5 SEQ ID NO: 49 
SEQ ID NO: 50 

6 SEQ ID NO: 51 
SEQ ID NO: 52 

7 SEQ ID NO: 53 
SEQ ID NO: 54 

8 SEQ ID NO: 55 
SEQ ID NO: 56 
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56 TTCCCACCACCCCAGCCCAC 

57 GGTTGACTGTGTCCCTGGAG 

58 GGGAACACTGGAGCACTGAGC 

59 GGTGGTGAGGGTTTAGTGTG 

60 CTCCCCCCGCCTCCTGCCTC 

61 AAGGTGTTGGGTGGCATCTG 

62 GGCTCCAGGCTGCGGTTGGC 

63 TTGAAGAACCGTGTAAAAC 

64 TTGAGGCTGAAGGAATGA 

83 TCCTGGGTGGGCTGGCGAAGTC 

84 GTTTTGGGGCGGGAGCTGATGAAG 

Table V 

Temp. Product length Enzvme check 

65°C 162 bp BamHI(144, 118) 

64°C 294 bp Apal(189,98,7) 

57°C 281 bp XhoI(153, 128) 

59°C 360 bp Spel (292, 68) 

57°C 291 bp BamHI(157, 134) 

52°C 273 bp DraII(179,72,22) 

59°C 290 bp EcoRI(184, 106) 

58°C 261 bp Haelll (158,103) 
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9 SEQ ID NO: 57 57°C 
SEQ ID NO: 58 

10 SEQ ID NO: 59 56°C 
SEQ ID NO: 60 

11 SEQ ID NO: 61 60°C 
SEQ ID NO: 62 

12 SEQ ID NO: 63 51°C 
SEQ ID NO: 64 

prom. SEQ ID NO: 83 63°C 
SEQ ID NO: 84 

Example 3: Detection of polymorphic regions in the human SR-BI gene bv SSCP 

Genomic DNA from a population of 389 unrelated Caucasian men and women, 
chosen because they had a known HDL and LDL level (high, normal, or low), known body 
mass index, known level of triglycerides, and known age (see Table VI) was analyzed as 
described below. 

Table VI 



Anthropometric and plasma lipid concentrati ons of the population studied 





Men (n=101) 


Women (n=288) 


Age (years) 


40 ±16 


36 ± 12 


BMI (kg/m 2 ) 


25.2 ±3.3 


22.8 ±3.6 


TC(mg/dL) . 


227 ± 57 


198 ±45 


LDL-C (mg/dL) 


158 ±49 


122 ±39 


HDL-C (mg/dL) 


45 ±23 


63 ±17 


TG (mg/dL) 


120 ±64 


68 ±34 



Genomic DNA from each of these individuals was subjected to PCR in 25 pi 
reactions (IX PCR Amplitaq polymerase buffer, 0.1 mM dNTPs, 0.8 uM 5' primer, 0.8 uM 
3' primer, 0.75 units of Amplitaq polymerase, 50 ng genomic DNA) using each of the above 
described pairs of primers under the following cycle conditions: 94°C for 2 min, 35 x [94°C 
for 40 sec, annealing temp for 30 sec, 72°C for 1 min], 72°C 5 min, 4°C hold. The 



206 bp PstI (107,99) 

253 bp Avail (148,105) 

327 bp Ncol (242, 85) 

303 bp PstI (184,1 19) 

247 bp BstXI (200, 47) 
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optimum PCR annealing temperatures for each set of primers are given in Table V. The 
expected sizes of the PCR products, as well as diagnostic restriction sites, are also indicated 
in Table V. 

The amplified genomic DNA fragments were then analyzed by SSCP (Orita et al. 

5 (1989) PNAS USA 86:2766, see also Cotton (1993) Mutat Res 285:125-144; and Hayashi 
(1992) Genet Anal Tech Appl 9:73-79). From each 25 |il PCR reaction, 3 jil was taken and 
added to 7 \xl of loading buffer. The mixture was heated to 94°C for 5 min and then 
immediately cooled in a slurry of ice- water. 3-4 \il were then loaded on a 10% 
polyacrylamide gel containing 10% glycerol and then subjected to electrophoresis either 

10 overnight at 4 Watts at room temperature, overnight at 4 Watts at 4°C (for amplifying a 
promoter region), or for 5 hours at 20 Watts at 4°C (for amplifying exons 3 and 4). The 
secondary structure of single-stranded nucleic acids varies according to sequence, thus 
allowing the detection of small differences in nucleic acid sequence between similar nucleic 
acids. At the end of the electrophoretic period, the DNA was analyzed by gently overlaying 

1 5 a mixture of dyes onto the gel ( 1 x the manufacturer' s recommended concentration of S YBR 
Green I and SYBR Green II in 0.5 X TBE buffer (Molecular Probes)) for 5 min, followed 
by rinsing in distilled water and detection in a Fluoroimager 575 (Molecular Dynamics). 
Polymorphisms were found in or near exons 1, 3, 5, 8, and 10. 

20 Example 4: Identification of polymorphic regions in the human SR-BI gene bv direct 
sequencing of PCR products 
To determine the sequences of the polymorphisms identified, the regions containing 
the polymorphisms were reamplified using the aforementioned primers which were 
modified to contain additional sequence which could be used to directly sequence the PCR 

25 product (Ml 3 forward sequence for 5' primer and +M13 reverse sequence for 3' primer) on 
the 5* end of the primers as listed in Table IV. In particular, the forward primers (5' end 
primers) contained the nucleotide sequence "TGTAAAACGACGGCCAGT" (SEQ ID NO: 
85) located 5' of the nucleotide sequences shown in Table IV and the reverse primer (3' end 
primer) contained the nucleotide sequence "CAGGAAACAGCTATGACC" (SEQ ID NO: 

30 86) located 5' of the nucleotide sequence shown in Table III. The genomic DNA from the 
subjects was subjected to PCR in 50 |il reactions (Ix PCR Amplitaq polymerase buffer, 0.1 
mM dNTPs, 0.8 nM 5' primer, 0.8 ^M 3' primer, 0.75 units of Amplitaq polymerase, 50 ng 
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genomic DNA) using each of the above described pairs of primers under the following cycle 
conditions: 94°C for 2 min, 35 x [94°C for 40 sec, annealing temp for 30 sec, 72°C for 1 
min], 72°C 5 min, 4°C hold. The optimum PCR annealing temperatures for each of the 
primer pairs are given in Table V. The newly amplified products were then purified by 
5 agarose gel electrophoresis and subjected to sequencing using Ml 3 forward and reverse 
primers. 

The results indicate that the polymorphism in the region of exon 1 is a change from 
a guanine at nucleotide 146 to an adenine, resulting in a change of the second amino acid 
of the protein from a glycine to a serine. The polymorphism in the region of exon 3 is a 

10 change from the guanine at position to 1 19 to an adenine, resulting in a change of amino 
acid 135 of the protein from a valine to isoleucine. The polymorphism in the region of exon 
8 was determined to constitute a change in base position 41 of exon 8, from a cytidine to 
a thymidine. This substitution does not result in a change in amino acid. In a subpopulation 
of 142 individuals, about 35% of these individuals were homozygous for an allele having 

15 a cytidine at position 41 of exon 8; about 17% of these individuals were homozygous for 
an allele having a thymidine at this position; and about 48% of these individuals were 
heterozygous, having one allele of each type. The polymorphism in the region of exon 5 
is a change in nucleotide 54 of intron 5 (nucleotide 1 being the first nucleotide of the 
intron), from a cytidine to a thymidine. In a subpopulation of 142 individuals, about 24% 

20 of individuals have a thymidine at position 54 of intron 5. The polymorphism in the region 
of exon 1 1 is a change from the cytidine at position -41 (nucleotide -1 corrresponds to the 
most 3* nucleotide of intron 10) of intron 10 to a guanine. The polymorphisms are indicated 
in Table III and in Figure 2. 

25 Example 5: Association o f common polymorphisms at the SR-BI gene with plasma 
lipids and anthropometric parameters 

After identification of the mutations in exons 1, 3, and 8 and introns 5 and 10, 
subjects were typed by digestion of PCR products using the primers and enzymes listed in 
Table VI. 

30 Table VII 

Primers and enzv mes for typing allelic variants 
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poly- 
morphism 


primers 


temp 


digest 


product sizes 


exon 1 
(G/A) 


CCGGCGATGGGGCATAAAACCACT 
CGCCCAGCACAGCGCACAGTAGC 


68-62C 


Alul 


GG: 263 

GA: 263, 192.71 
AA: 192.71 


intron 5 
(C/T) 


GCCCAGAATGTTCAGACCAG 
GCACCCTCTTCACGACAAAG 


57C 


Apal 


CC: 194. 67. 30 
CT: 194,97, 67, 30 
TT: 194.97 


exon 8 
(C/T) 


CCTTGTTTCTCTCCCATCCTCACTTCCTCAAGGC 
CACCACCCCAGCCCACAGCAGC 


66-6 1 C 


Haelll 


CC: 154,33,31 
CT: 154. 64. 33,31 
TT: 154. 64 



10 Plasma lipids were measured after a 12 to 14-hour overnight fast, using blood 

collected in tubes containing 0.1% EDTA. Plasma HDL-cholesterol was measured after 
precipitation of plasma apoB-containing lipoproteins as previously described. Plasma total 
cholesterol, HDL cholesterol and triglyceride levels were measured as previously described. 
LDL cholesterol was calculated by the Friedewald equation when triglyceride levels were 

1 5 below 400 mg/dL. Coefficients of variation between runs for all lipid assays were less than 
5%. 

The SPSS statistical package PC version 7.5.1 was used for the statistical analysis. 
Because of the differences observed between men and women for several of the 
anthropometric and lipid variables, all the statistical analyses were carried out separately 

20 by gender. Triglycerides were log transformed for analysis. For each of the variables 
examined, the significance of the differences between alleles or genotypes was estimated 
by analysis of covariance (ANCOVA) using the General Linear Model (GLM) procedure 
from SPSS, with age as covariate and the Tukey ' s post hoc test for multiple comparisons 
for observed means. For analysis related to HDL-C, smoking and alcohol intake were also 

25 included as covariates. Means and standard deviations for all variables were calculated for 
each genotype group and the significance level was established at p<0.05. The allele and 
haplotype frequencies were estimated using the EH linkage utility program (Terwilliger and 
Ott J. (1994) Handbook for human ge netic linkag e Johns Hopkins University Press, 
Baltimore, MD). 

30 The frequencies of the less common allele for each of the polymorphisms described 

at the SR-BI gene locus were as follows: exon 1 : 0.1 136; exon 3: 0.0184; intron 5: 0.1002; 
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exon 8: 0.4389 and intron 11: 0.0425. The associations between these common 
polymorphisms and plasma lipid concentrations and BMI are presented in table VII for men 
and table VIII for women. For men, no significant associations were observed between any 
of the variables examined and the common polymorphisms at exon 1, intron 5 and exon 8. 

5 Conversely, in women, the less common allele defined by the polymorphism at exon 8 was 
associated with significantly lower mean plasma LDL cholesterol concentrations (11 8+38 
and 1 16+36 mg/dL for heterozygotes and homozygotes, respectively) than those observed 
in subjects homozygous for the most common allele (131+42 mg/dL; p=0.043). No other 
significant associations were observed between these polymorphisms and other lipid 

10 variables. In women, a significant association was observed between the intron 5 
polymorphism and BMI. Women carriers of the less common allele showed a mean BMI 
value (23.8+3.8) that was significantly greater (p=0.031) than those women homozygous 
for the most common allele (22.4+3.4). 

Table VIII 

15 Anthropometric characteristics and plasma lipid concentration of the population studied 



according to SR-BI genotypes (Meiri 





Exon 1 


Intron 5 


Exon 8 




l/i 

(n=71) 


1/2 (n=8) 


P 

value 


l/i 

(n=74) 


1/2 

(n=19) 


P 

value 


l/l 

(n=22) 


1/2 

(n=49) 


2/2 

(n=18) 


P 

value 


Age 
(years) 


41 + 14 


38+16 


0.659 


40+15 


46+13 


0.114 


47+14 


42 + 14 


37 + 13 


0.068 


BMI 
(kg/rrr) 


25.7 + 
3.0 


25.6 + 
1.9 


0.883 


25.3 + 
2.6 


26.1 + 
3.8 


0.326 


25.7 + 
3.6 


25.8 + 
2.5 


25.0 + 
3.2 


0.583 


TC 

(mg/dL) 


228 + 46 


221+38 


0.712 


225 + 55 


240 + 41 


0.282 


234 + 40 


221+51 


225 +47 


0.559 


LDL-C 
(mg/dL) 


159 + 46 


157 + 47 


0.908 


156 + 5! 


175 + 48 


0.145 


168 + 44 


154 + 52 


158 + 47 


0.520 


HDL-C 
(mg/dL) 


43 + 22 


43+14 


0.983 


43 + 24 


45+19 


0.733 


39+17 


43 + 25 


44+16 


0.761 


TG 

(mg/dL) 


127 + 65 


105+43 


0.346 


129 + 68 


98 + 27 


0.052 


133 + 63 


122 + 60 


119 + 69 


0.715 



Table IX 
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Anthropometric characteristics and plasma lipid concentration of the population studied 

according to SR-BI genotypes (Women) 





Exon 1 


Intron 5 


Exon 8 




i/t 

(n=18l) 


1/2 

(n=66) 


2/2 
(n=l) 


P 

value 


l/l 

(n=74) 


1/2 

(n=19) 


P 

value 


l/l 

(n-73) 


1/2 

(n=148) 


2/2 

(n=37) 


P 

value 


Age 
(years) 


35+12 


38+12 


40 


0.218 


36 + 
12 


38 + 
12 


0.333 


37+12 


36+12 


34 + 
12 


0.458 


BMI 

(kg/rrr) 


22.9 + 
3.9 


22.5 + 
3.0 


19.7 


0.552 


22.4 + 
3.4 


23.8 + 
3.8 


0.031 


22.8 + 
3.0 


23.0 + 
3.9 


21.9 + 
3.4 


0.252 


TC 

(mg/dL) 


197 + 

45 


198 + 42 


211 


0.932 


198 + 
46 


204 + 
44 


0.446 


206 + 
49 


196 + 42 


192 + 
42 


0.215 


LDL-C 
(mg/dL) 


121 + 

39 


120 + 38 


134 


0.928 


122 + 
39 


125 + 
42 


0.711 


131 + 
42 


118 + 38 


116 + 
36 


0.043 


HDL-C 
(mg/dL) 


63+17 


64+19 


63 


0.930 


62 + 
16 


64 + 
21 


0.145 


61 + 16 


64+17 


64 + 
22 


0.500 


TG 

(mg/dL) 


64 + 28 


71 +37 


72 


0.317 


68 + 
35 


64 + 
30 


0.423 


67 + 35 


70 + 38 


57 + 
22 


0.146 



Haplotype analyses (using the method described by Terwillinger and Ott, 1994, supra) 
using the three most common polymorphisms identified at this locus (exon 1 , intron 5 and exon 

20 8) were carried out. Six possible haplotypes were identified according to the absence (1) or 
presence (2) of the variant allele at each one of the three polymorphic sites. Four of the haplotypes 
were common: 1 1 1 (wild type); 1 12 (exon 8 variant); 121 (intron 5 variant); 21 1 (exon 1 variant); 
whereas two of the estimated haplotypes were rare: 221 (variants at exon 1 and intron 5) and 212 
(variants at exons 1 and 8). The most common haplotype, 111, had the highest apparent 

25 frequency in women (44.6%) and therefore was considered to be wild-type. Each subject was 
assigned to the most plausible genotype; however, because of the uncertainty associated with 
genotype assignments in double heterozygotes when studying unrelated subjects on whom the 
phase of the polymorphisms cannot be directly ascertained, we used in further analysis only those 
subjects with unequivocal genotypes. In women, the 111/112 and 112/112 genotypes were found 

30 to be associated with lower LDL-C levels as compared with the wild genotype (111/111); whereas 
the 1 1 1/121 genotype was associated with increased LDL-C levels as compared with the 1 1 1/1 12 
and 1 1 1/21 1 genotypes (Figure 4). In men the trends were similar to those observed in women; 
however, the differences did not reach statistical significance (Figure 5). In terms of HDL-C 
levels, all the alleles carrying mutations were associated with increased HDL-C levels in men 

35 (Figure 6); however, no significant effects were observed in women (Figure 7). Thus, as 
compared to the wild-type haplotype, the analyses revealed that men with haplotypes 1 12, 121, 
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and 21 1 tended to have significantly higher HDL. In fact, those subjects with haplotype 21 1 (one 
wild-type chromosome and one with a polymorphism in exon 1) had an average of 75% higher 
HDL levels than individuals containing only wild-type chromosomes. Such associations were 
not observed in women. Without wanting to be limited by a specific mechanism of action, it has 

5 been shown that, though less efficient than the LDL receptor, SR-BI is able to mediate the 
degradation of LDL in vitro. SR-BI may also play an indirect role in LDL cholesterol metabolism 
by altering cholesterol homeostasis in the individual. 

The previous association between the intron 5 polymorphism and BMI observed in 
women using single marker analysis was stronger using haplotype analysis. Subjects carrying 

lOthe 121 haplotype (n=12), had a mean BMI (25.0"3.3) significantly greater (p<0.05) than that 
observed in subjects homozygous for the wild haplotype 1 1 1 (n=29; 22.8"3.2) and those carriers 
of the 112 (n-78; 22.8 ,f 3.9) and 211 (n=12; 21.5 ,, 2.1) haplotypes (Figure 8). However, no 
association was observed in men (Figure 9). 

There was also a significant association between the intron 5 polymorphism and BMI in 

1 5 women. The latter finding was most significant in premenopausal women. The 288 female 
subjects in this study were not considered to be obese. Thus, this effect was observed on 
individuals within the normal weight range. There are only a few polymorphisms known to date 
that are associated with BMI values in humans (Bouchard and Perusse (1996) Obesity Research 
4:81-90), and thus this is a significant finding. Increased body-mass index has been associated 

20 with higher mortality from all causes and from cardiovascular disease. For mortality from 
cardiovascular disease, the relative risk associated with an increment of one in the body-mass 
index in women in the age range of 30-to-44 year old has been reported to be 1.08 (95 percent 
confidence interval, 1.05 to 1.11) (Stevens et al. (1998) N Engl J Med 338:1-7; Colditz et al. 
(1995) Ann Intern Med 122(7):48 1-486). The data presented herein show that for an average 

25 woman, the presence of the 121 haplotype raises the BMI by approximately 2.2 kg/m2 which 
corresponds to about 6 kgs. This increase in BMI could result in an increase in CHD mortality 
of about 1 7.6%, primarily due to a greater risk of developing non-insulin dependent diabetes 
mellitus (Stevens et al. (1998) supra', Colditz et al. (1995) supra), a major risk for coronary artery 
atherosclerosis. An intriguing observation is the fact that no subjects were found to be 

30 homozygous for this polymorphism in this sample of 389 randomly selected individuals, despite 
the expected frequency of 1% (~4 individuals) assuming Hardy- Weinberg equilibrium. 

All of the above-cited references and publications are hereby incorporated by reference. 
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CLAIMS 



1 . A method for determining whether a female subject has, or is at risk of developing, a 
body mass disorder, comprising determining the identity of the allelic variant of a 

5 polymorphic region of the SR-BI gene of the subject and comparing the allelic variant 

of the subject with allelic variants associated with body mass disorders, to thereby 
determine whether the subject has an allelic variant of a polymorphic region of an SR- 
BI gene associated with a body mass disorder. ^ 

10 

2. The method of claim 1 , wherein the body mass disorder is an abnormally high body 
mass. 

3. A method of claim 2, wherein the polymorphic region is located in an intron. 

15 

4. A method of claim 3, wherein the intron is intron 5. 

5. A method of claim 4, wherein the polymorphic region is a nucleotide polymorphism. 

20 6. A method of claim 5, wherein the nucleotide polymorphism is located at position 54 
of intron 5. 



7. A method of claim 6, wherein nucleotide 54 of intron 5 of the SR-BI gene in a 

normal subject is a cytidine and the presence of a nucleotide other than a cytidine at 
25 position 54 on intron 5 in the SR-BI gene of a subject indicates that the subject has 

or is at risk of developing an abnormally high body mass. 



8. 

30 



A method of claim 7, wherein the nucleotide other than a cytidine at position 54 of 
intron 5 is a thymidine. 
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9. A method of claim 1 , wherein determining the identity of the allelic variant of a 

polymorphic region of an SR-BI gene comprises determining the identity of at least 
one nucleotide of the polymorphic region. 

5 10. A method of claim 2, wherein determining the identity of the allelic variant of a 

polymorphic region comprises contacting a nucleic acid of the subject with at least 
one probe or primer which is capable of hybridizing to an SR-BI gene. 

11. A method of claim 10, wherein the probe or primer is capable of specifically 
10 hybridizing to an allelic variant of the polymorphic region. 

12. A method of claim 11, wherein the probe or primer is capable of specifically 
hybridizing to an allelic variant having a thymidine at position 54 of intron 5 of the 
SR-BI gene. 

15 

13. A method of claim 1 , wherein the probe or primer has a nucleotide sequence from 
about 15 to about 30 nucleotides. 

14. A method of claim 1, wherein the probe or primer is a single stranded nucleic acid. 

20 

15. A method of claim 1 , wherein the probe or primer is labeled. 

16. A method of claim 1, wherein determining the identity of the allelic variant of a 
polymorphic region is carried out by allele specific hybridization. 

25 

17. A method of claim 1, wherein determining the identity of the allelic variant of a 
polymorphic region is carried out by primer specific extension. 



18. 

30 



A method of claim K wherein determining the identity of the allelic variant of a 
polymorphic region is carried out by an oligonucleotide ligation assay. 
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19. A method of claim 1 , wherein determining the identity of the allelic variant of a 
polymorphic region comprises performing a restriction enzyme site analysis. 

20. A method of claim 1 9, wherein the restriction enzyme is an Apal enzyme. 

5 

21 . A method of claim 1, wherein determining the identity of the allelic variant of a 
polymorphic region is carried out by single-stranded conformation polymorphism. 

22. A kit for determining whether a subject has, or is at risk of developing, a body mass 
10 disorder, comprising a probe or primer which is capable of hybridizing to an SR-BI 

gene and thereby identifying whether the SR-BI gene contains an allelic variant of a 
polymorphic region which is associated with the body mass disorder and instructions 
for use in diagnosing a subject as having, or having a predisposition, towards 
developing a body mass disorder. 

15 

23. The kit of claim 22, wherein the body mass disorder is an abnormally high body 
mass. 

24. A kit of claim 22, wherein the polymorphic region is a nucleotide polymorphism. 

20 

25. A kit of claim 22, wherein the nucleotide polymorphism is located in an intron. 

26. A kit of claim 25, wherein the intron is intron 5. 

25 27. A kit of claim 26, wherein the polynucleotide polymorphism is located at nucleotide 
54 of intron 5. 

28. A kit of claim 27, wherein nucleotide 54 of intron 5 is a substitution of a cytidine 
with a thymidine. 

30 

29. A kit of claim 22, wherein the probe or primer is capable of hybridizing to the 
polymorphic region of the SR-BI gene. 
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30. A kit of claim 29, wherein the probe or primer is capable of hybridizing to a region 
of the human SR-BI comprising nucleotide 54 of intron 5. 

5 31. A kit of claim 30, wherein the probe or primer is capable of hybridizing specifically 
to a region of the human SR-BI gene comprising a thymidine at position 54 of intron 
5. 

32. A kit of claim 22, wherein the primer hybridizes to a region of an SR-BI gene 
1 0 adjacent to the polymorphic region. 

33 . A kit of claim 22, further comprising a second primer. 

34. A kit of claim 22, wherein the probe or primer has a nucleotide sequence from about 
15 1 5 to about 3 0 nucleotides. 

35. A kit of claim 22, wherein the probe or primer is a single stranded nucleic acid. 

36. A method for determining whether a female subject has, or is at risk of developing, 
20 an abnormally high LDL level, comprising determining the identity of the allelic 

variant of a polymorphic region of the SR-BI gene of the subject and comparing the 
allelic variant of the subject with allelic variants associated with high LDL levels, to 
thereby determine whether the subject has or is at risk of developing an abnormally 
high LDL level. 

25 

37. A method of claim 36, wherein the polymorphic region is located in an exon. 

38. A method of claim 37, wherein the exon is exon 8. 

30 39. A method of claim 38, wherein the polymorphic region is a nucleotide 
polymorphism located at nucleotide 41 of exon 8. 
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40. A method of claim 39, wherein the presence of a cytidine at nucleotide 41 of exon 8 
of the SR-BI gene is indicative that the subject has or is at risk of developing an 
abnormally high LDL level. 

5 41. A method of claim 36, wherein the polymorphic region is located in an intron. 

42. A method of claim 41 , wherein the intron is intron 5. 

43. A method of claim 42, wherein the polymorphic region is a nucleotide 
1 0 polymorphism located at nucleotide 54 of intron 5. 

44. A method of claim 43, wherein the presence of a thymidine at nucleotide 54 of 
intron 5 of the SR-BI gene is indicative that the subject has or is at risk of 
developing an abnormally LDL level. 

15 

45. A method for determining whether a male subject has, or is at risk of developing, an 
abnormally low HDL level, comprising determining the identity of the allelic variant 
of a polymorphic region of the SR-BI gene of the subject and comparing the allelic 
variant of the subject with allelic variants associated with low HDL levels, to thereby 

20 determine whether the subject has or is at risk of developing an abnormally low 

HDL level. 

46. A method of claim 36, comprising determining the identity of the nucleotide at 
position 146 in exon 1, nuclotide 41 in exon 8 and/or nucleotide 54 in intron 5, 

25 wherein the presence of a guanine at position 146 of exon 1, the presence of a 

cytidine at position 41 of exon 8 and/or the presence of a cytidine at position 54 of 
intron 5 indicates that the subject has or is at risk of developing an abnormally low 
LDL level. 



30 47. 



A kit for determining whether a subject has, or is at risk of developing, a high LDL 
level, comprising a probe or primer which is capable of hybridizing to an SR-BI 
gene and thereby identifying whether the SR-BI gene contains an allelic variant of a 
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polymorphic region which is associated with a high LDL level and instructions for 
use in diagnosing a subject as having, or having a predisposition, towards developing 
a high LDL level. 

5 48. A kit of claim 47, wherein the polymorphic region is located in an exon. 

49. A kit of claim 48, wherein the exon is exon 8. 

50. A kit of claim 49, wherein the polymorphic region is a nucleotide polymorphism 
10 located at nucleotide 41 of exon 8. 

51. A kit of claim 50, wherein the presence of a cytidine at nucleotide 4 1 of exon 8 of 
the SR-BI gene is indicative that the subject has or is at risk of developing an 
abnormally high LDL level. 

15 

52. A kit of claim 47, wherein the polymorphic region is located in an intron. 

53. A kit of claim 52, wherein the intron is intron 5. 

20 54. A kit of claim 53, wherein the polymorphic region is a nucleotide polymorphism 
located at nucleotide 54 of intron 5. 

55. A kit of claim 54, wherein the presence of a thymidine at nucleotide 54 of intron 5 of 
the SR-BI gene is indicative that the subject has or is at risk of developing an 

25 abnormally LDL level. 

56. A kit for determining whether a male subject has, or is at risk of developing, an 
abnormally low HDL level, comprising a probe or primer which is capable of 
hybridizing to an SR-BI gene and thereby identifying whether the SR-BI gene 

30 contains an allelic variant of a polymorphic region which is associated with a low 

HDL level and instructions for use in diagnosing a subject as having, or having a 
predisposition, towards developing a low HDL level. 
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57. A kit of claim 56, wherein the polymorphic region is a nucleotide selected from the 
group consistin of nucleotide 146 in exon 1, nuclotide 41 in exon 8 and nucleotide 
54 in intron 5, wherein the presence of a guanine at position 146 of exon 1, the 
5 presence of a cytidine at position 41 of exon 8 and/or the presence of a cytidine at 

position 54 of intron 5 indicates that the subject has or is at risk of developing an 
abnormally low LDL level. 



10 
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promoter and exon 1 

ACTGCGGAGATGAGGGTCTAGAAGGTGGTGGCGGGGCAT 
GTGGACCGTTGTAAGGGCTCTGGGGTTCCTGGGTGGGCT 
GGCGAAGTCCTACTCACAGTGACCAACCATGATGATGGT 
CCCGATAGAGGAGGAGAGGGAGGAGGAGGGAAAAGGAAG 
GGTGAGGGGCTCAGAGGGGAGAGCTGGGAGGAGGGGAGA 
CATAGGTGGGGGAAGGGGTAGGAGAAAGGGGAAGGGAGC 
AAGAGGGTGAGGGGCACCAGGCCCCATAGACGTTTTGGC 
TCAGCGGCCACGAGGCTTCATCAGCTCCCGCCCGAAAAC 
GGAAGCGAGGCCGTGGGGGCAGCGGCAGCATGGCGGGGC 
TTGTCTTGGCGGCCATGGCCCCGCCCCCTGCCCGTCCGA 
TCAGCGCCCCGCCCCGTCCCCGCCCCGACCCCGCCCCGG 
GCCCGCTCAGGCCCCGCCCCTGCCGCCGGAATCCTGAAG 
CCCA AGGCTGCC CGGGGGCGGTCCGGCGGCGCCGGCGAT 
GGGG dATAAAAj CCACTGGCCACCTGCCGGGCTGCTCC 

TGCGTGCGCTGCCGTCCCGGA TCCACCGTGCCTCTGCGG 
CCTGCGTGCCCGGAGTCCCCGCCTGTGTCGTCTCTGTCG 
CCGTCCCCGTCTCCTGCCAGGCGCGGAGCCCTGCGAGCC 
GCGGGTGGGCCCCAGGCGCGCAGACATGAGCTGCTCCGC 
CAAAGCGCGCTGGGCTGCCGGGGCGCTGGGCGTCGCGGG 
GCTACTGTGCGCTGTGCTGGGCGCTGTCATGATCGTGAT 
GGTGCCGTCGCTCATCAAGCAGCAGGTCCTTAAG 

GTGGGTGAGGGAGAC CC C AGGGGGT C CGCGCACGGAC CC 
GGGCTGTTGGGCGCTGGGCGCCGGGAGGACCCGCGCGTT 
GCGGTGGGTGGGCGACCGCAGCGGAATCGGCGCCCGGGC 
CTGGCGCCGCAGAACACGAGGGAGGCCAGGCGCTTCGGG 
AGGGGCTGCTGCCCGCCTCCCCACCACCCTCACC 

Fig. 2A 
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exon 2 

AGCCTCATGTGCGAAGGGCTTTCCCACCACCTCCTATCC 

CAAGCTCCCGCCGAGGAGCCCCTTCCCTGGCCGGGCTCG 

GGCAGCTGTTCCGGAGCCTTGTGGTGGGGCGTGGGGCC 

CTCATCACTCTCCTCACAAGCGTACTTGTCCCTTCCC 

CTGCAG 

AACGTGCGCATCGACCCCAGTAGCCTGTCCTTCAACATG 
TGGAAGGAGATCCCTATCCCCTTCTATCTCTCCGTCTAC 
TTCTTTGACGTCATGAACCCCAGCGAGATCCTGAAGGGC 
GAGAAGCCGCAGGTGCGGGAGCGCGGGCCCTACGTGTAC 
AG 

GTGAGGCTGTGTCCACGTGATGGTGGACGGGCCGGCTGA 

CGCTGGGCATGGGACGGGTCTCANAGTGGACGGGATG 

GGGAGGCTGCTGACTGACCCCCAAACATTGTTCCGGAA 

GCACGCAACTCATAGTCGGGGTAAGTGCTACTCCCAAAA 

AAGTTTGCGT 



exon 3 

CATGTCCTGCAGTGGGCAGGCAGCGGGAGGGACAGACTT 

GGCGAAGGGGCCGAGCTCAGCTTTGGCTGTGGGGCCGGA 

GGTGTGCACAGACGTCCAGGGCCCCTGGTTCCCAGGCAG 

GCATTGCAGGCGAGTAGAAGGGAAACGTCCCATGCAG 

CGGGGCGGGGCGTCTGACCCACTGGCTTCCCCCACAG 

GGAGTTCAGGCACAAAAGCAACATCACCTTCAACAACAA 
CGACACCGTGTCCTTCCTcGAGTACCGCACCTTCCAGTT 
CCAGCCCTCCAAGTCCCACGGCTCGGAGAGCGACTACAT 
CATCATGCCCAACATCCTGGTCTTG 

GTGAGGCTGCCCTGTGGCCCACGCCGCCTCGCACCCTGA 

CCTCGTCCCCTGTCTCTCCTCCCGCCTGCCCCTTGTG 

CAGAGAGCAGTCCCTGAGGTGGTCGGAGCGTGGGGACTC 

ACGCCTGGTGGGTGGCTTTCGGCCCTGTGCTGTCTCCAC 

CACCCCCA 



Fig. 2B 
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exon 4 

GGTGGTTCTGGTGTCCCAGATGCCCCACGTGGCCACTCC 

AGGGGCCTCCTGCACCCCAGCATTTCCCTTCATGGGCT 

CTTTGCTGTGAGGCCCAGCTGGGGCCAAGGGAGGATG 

GGCCAGCCACGTCCAGCCTCTGACACTAGTGTCCCTTCG 

CCTTGCAG 

GGTGCGGCGGTGATGATGGAGAATAAGCCCATGACCCTG 
AAGCTCATCATGACCTTGGCATTCACCACCCTCGGCGAA 
CGTGCCTTCATGAACCGCACTGTGGGTGAGATCATGTGG 
GGCTACAAGGACCCCTTGTGAATCTCATCAACAAGTACT 
TTCCAGGCATGTTCCCCTTCAAGGACAAGTTCGGATTAT 
TTGCTGAG 

GTACGTGTGGCCTGGTGAGAAGCCAAAGATTCAGGCCTG 

TGTCCTGTCTTCCCCTCACACAGCCTGGACACTGGTC 

ACCAGCTTGCTTTGTAGCTGGCTGGGGATCTAGTGGCTG 

TGGGTTGTAAGTGACTGAGAACCTGACTCAAACCGGCTT 

GAGTGAAA 



exon 5 

CCTCTCGGTCCCCAGACACTGGGCATTTGGCAGTGAACC 

AGATGCTGGGGGCCCTGTCCTTCTGGTGGAGGGGGAGGA 

GGGCTCAGCCCAGAATGTTCAGACCAGGCCGGCTCAA 

TGGCAGGCCTAAGCCTTACGATGCTGTTCCCTGCTGTGT 

CTGTAG 

CTCAACAACTCCGACTCTGGGCTCTTCACGGTGTTCACG 
GGGGTCCAGAACATCAGCAGGATCCACCTCGTGGACAAG 
TGGAACGGGCTGAGCAAG 

GTGAGGGGCGAGAGGCGAGGGCCCCTGTCGCCAGGGAGA 

GGGGAGGGTGGGCCTGGCCATGGCTGCTCGGGAGTGGCA 

GGGACCAGAGAGCTCCTTCTTCCTTTGTCGTGAAGAG 

GGTGCTGGGAGGATGAACACTCTTGAAGTTGGAGGAGGG 

ATTTTA 



Fig. 2C 
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exon 6 

TCTCTGTGTGTCTACATAGCCTGCCCTCTTCCCACCGTG 

CCAGTATTGGGAATTGAGTGGCCGTGCGTGCACCAGGGT 

GAGTTAGGTGTGCAGCACCTGAGAGGGCTTATTAAGG 

GGCCTTGGCCCTACTGAGGGGTCTAGTCTGGATGCTTCC 

CCCCAG 

GTTGACTTCTGGCATTCCGATCAGTGCAACATGATCAAT 
GGAACTTCTGGGCAAATGTGGCCGCCCTTCATGACTCCT 
GAGTCCTCGCTGGAGTTCTACAGCCCGGAGGCCTGCCG 

GTAATCACTGGGACTCGGGGCCTCCTGGGTTTCCTGGGT 

AGCTCATGGCCAAATTCTGTGGTGTTGGCTGTGCACTT 

GGAAAGCATTTTGACTCATCGTGGATTTGACTCAGTAG 

CCCTTGGCACCAGCTTGAATTCTCTTTGGTCACACCACC 

AAAAGC 



exon 7 

GGAGGTCGCTGCAGCTCCGCGGGTGAGAGATGGGGGCGG 

TTTGGACCCGGGAGGTGGTAGCGCCCGTGGGGAGAAGTG 

GCTGGATCTGGGCAGCCTTTGGCAGGGCCTGGCTCTGGC 

CGCCGGGTCTGGGTGTCCCCTCTCATCCTGTCTGTCC 

CCTGCAG 

ATCCATGAAGCTAATGTACAAGGAGTCAGGGGTGTTTGA 
AGGCATCCCCACCTATCGCTTCGTGGCTCCCAAAACCCT 
GTTTGCCAACGGGTCCATCTACCCACCCAACGAAGGCTT 
CTGCCCGTGCCTGGAGTCTGGAATTCAGAACGTCAGCAC 
CTGCAGGTTCA 

GTACGTGCCGTCCCCTGTTCTGGGATNGCCGGAGGGTGT 
TAGGTNTNGGGCACCTNANGGTTTATCTGCCCAATGCTG 
TCTGCTTAATCTCTGGCCTCTGTACTCTTGATAACC 
CATTAAGCCAAAAATATGATGCCTCTGGGACGATATCTG 



Fig. 2D 
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exon 8 



TGGGGCTTTTTACAGAATGGAGGAAGGGATCCTCTCT 

GTCGGGTATTATGGTCATCGCCACGGGGGTGCCGTGCAG 
ACCACAGCTCTGTGCAGACTTCCGGAGTGGCAGGACGTG 
CCAATATACTGTCGTTGTATGATGTCCCCTCCCTGCCCT 
TGTTGTAG 

GTGCCCCCTTGTTTCTCTCCCATCCTCACTTCCTCAACG 
CCGACCCGG'TTCTGGCAGAAGCGGTGACTGGCCTGCACC 
CTAACCAGGAGGCACACTCCTTGTTCCTGGACATCCACC 
CG 

GTGAGCCCCTGCCATCCTCTGTGGGGGGTGGGTGATTCC 
TGGTTGGAGCACACCTGGCTGCCTCCTCTCTCCCCAG 
GCAGAGAGCTGCTGTGGGCTGGGGTGGTGGGAAGCCTGG 
CTTCTAGAATCTCGAGCCACCAAAGTTCCTTACT 



exon 9 

CCCCAGCCTGTGGCTTGTTTTAGGTAAGATACAAGCAAG 

CTCCACTGGGCAGTTAGCTGGGACGCCCACCCTCTTGAC 

TGGGACCAGGGAAAAGAAGGTTGACTGTGTCCCTGGA 

GCTTGGGGGTGGCCAGTCTCCTCACTGTGTTTGTTGCCG 

CAG 

GTCACGGGAATCCCCATGAACTGCTCTGTGAAACTGCAG 
CTGAGCCTCTACATGAAATCTGTCGCAGGCATTGG 

GTGAGTGGGGACTGGGAACTGGGGCTGCATTGCTCATTG 

AGAGATTANGTGCTCAGTGCTCCAGTGTTCCCAGAC 

TCCCCTGACATACCCCAGGAAACAGGGCATGGGGAAGGG 

AGAGGGTCCTATTGGGGGTGGAATCCAGTCCCTGCTGAT 

CTTCTC 

Fig. 2E 
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exon 10 

ATGGCTCCTAAAGTGTTTCAGCTCATTGTTTATATTTGG 

TGGTGAGGGTTTAGTGTGTGCAAAATTATACTAAACC 

TGTTTAGATGTTGTATTCAAGCAGAATTAGATCAAGTTT 

GGGTGTAAGACTTTGTTCCAACACCTATGTCTTGCTTAT 

TTCCAG 

ACAAACTGGGAAGATTGAGCCTGTGGTCCTGCCGCTGCT 
CTGGTTTGCAGAG 

GTAAGGGTGCGTTGGGCACAGCGTCGGGGGCTTTTGTTA 

ATAGCCAATGTGGGCATTTGAGGCAGGAGGCGGGGGG 

AGCACCTTGTAGAAAGGGAGAGGGCTGAGCCAGGGTAAC 

CGGACTGTTACATGGACCAGCGTATCATACACTTCACCC 

TGTC 



exon 11 

CCTGGAGGGAGGAGGTCCCTGGCAGGCTCCAACACATGC 

TTTAGCCGGGAAGCTTGAGGTGGGGAAAAGCTGAGGCGG 

GCACAGAGGAAGGTGTTGGGTGGCATCTGCGCTGTAG 

CCCGCAGCGTGCGGCCCCAGCTCATGTGTTTGTCATTCT 

GTCTCCTCAG 

AGCGGGGCCATGGAGGGGGAGACTCTTCACACATTCTAC 
ACTCAGCTGGTGTTGATGCCCAAGGTGATGCACTATGCC 
CAGTACGTCCTCCTGGCGCTGGGCTGCGTCCTGCTGCTG 
GTCCCTGTCATCTGCCAAATCCGGAGCCAA 

GTAGGTGCTGGCCAGAGGGCAGCCCGGGCTGACAGCCAT 

TCGCTTGCCTGCTGGGGGAAAGGGGCCTCAGATCGGACC 

CTCTGGCCAACCGCAGCCTGGAGCCCACCTCCAGCAG 

CAGTCCTGCGTCTCTGCCGGAGTGGGAGCGGTCACTGCT 

GGGGG 



Fig. 2F 
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exon 12 

CCCCACATCTCAGCCACCTGCAATCGTTGAGGGTTGTTG 

GACTCTAAACTTATGTGCCTTTCCTGTTTCCTCTTTGCC 

TTTTGCAAATTGAAGAACCGTGTAAAACCATTTTTAT 

GTGGCTTCAACGTCAACTATAAATTAGCTTGGTTATCTT 

CTAG 

GAGAAATGCTATTTATTTTGGAGTAGTAGTAAAAAGGGC 
TCAAAGGATAAGGAGGCCATTCAGGCCTATTCTGAATCC 
CTGATGACATCAGCTCCCAAGGGCTCTGTGCTGCAGGAA 
GCAAAACTGTAG 

GTGGGTACCAGGTAATGCCGTGCGCCTCCCCGCCCCCTC 

CCATATCAAGTAGAATGCTGGCGGCTTAAAACATTTGGG 

GTCCTGCTCATTCCTTCAGCCTCAACTTCACCTGGAG 

TGTCTACAGACTGAAGATGCATATTTGTGTATTTTGCTT 

TTGGAGAAA 



Fig. 2G 
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